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I.  INTRODUCTION 

Bone  health  is  critical  for  optimal  performance  and  the  prevention  of  fractures  associated 
with  low  bone  mineral  density  (BMD).  Our  first  two-year  period  of  funding  focused  on 
using  die  meta-analytic  approach  to  examine  the  effects  of  exercise  on  BMD  in  adult  humans 
using  smnmary  means  from  completed  studies.  Since  no  meta-analysis  had  existed  using 
individual  patient  data  (DPD)  to  examine  the  effects  of  exercise  on  BMD,  our  second  period 
of  funding  was  devoted  to  examining  the  feasibility  of  such.  The  specific  aims  of  the  second 
project  period  were  to  (1)  compare  summary  versus  IPD  in  relation  to  the  overall  magnitude 
of  effect  that  exercise  has  on  BMD,  (2)  compare  summary  versus  IPD  in  relation  to  the  effect 
of  potentially  confounding  variables  (age,  training  program,  etc.)  on  changes  in  BMD,  and 
(3)  provide  recommendations  for  future  research  regarding  the  use  of  summary  versus  IPD 
for  examining  the  effects  of  exercise  on  BMD.  The  results  of  this  project  will  help  identify 
the  best  approach  to  use  (summary  versus  IPD)  when  attempting  to  arrive  at  a  more  objective 
conclusion  regarding  the  effects  of  exercise  on  BMD  in  humans.  In  addition,  this  will  be  the 
first  meta-analysis  using  IPD  in  the  area  of  exercise  and  BMD.  Finally,  the  results  of  fiiis 
project  will  provide  the  Armed  Forces  with  a  better  understanding  of  the  effects  of  exercise 
on  BMD  and  will  also  help  to  identify  what  programs,  if  any,  will  provide  for  optimum  bone 
development  and  maintenance. 

II.  BODY 

A.  Statement  of  Work 

None.  No  work  was  conducted  for  the  past  year  because  this  grant  has  not  yet  been 
transferred  from  Massachusetts  General  Hospital  in  Boston  to  West  Virginia  University 
in  Morgantown. 

in.  KEY  RESEARCH  ACCOMPLISHMENTS 

None.  No  work  was  conducted  for  the  past  year  because  this  grant  has  not  yet  been 
transferred  from  Massachusetts  General  Hospital  in  Boston  to  West  Virginia  University  in 
Morgantown. 

rV.  REPORTABLE  OUTCOMOES  (Note:  The  items  referenced  below  represent  work 
which  was  conducted  prior  to  the  current  reporting  year  and  previously  listed  as  ^in 
press”) 

A.  Manuscripts  (Refereed) 

1 .  Kelley,  G. A.,  Kelley,  K.S.,  Tran,  Z.V.  Aerobic  exercise  and  regional  bone  density 
in  women:  A  meta-analysis  of  controlled  trials.  American  .Toumal  of  Medicine  & 
Snorts.  2002;  4:427-433.  (See  Appendix  A) 
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2.  Kelley,  G.A.,  Kelley,  K.S.,  Tran,  Z.V.  Retrieval  of  individual  patient  data  for  an 
exercise-related  meta-analysis.  American  Journal  of  Medicine  and  Snorts.  2002; 
4:350-354.  (See  Appendix^B) 

3.  Kelley,  G.A.,  Kelley,  K.S.,  Tran,  Z.V.  Exercise  and  lumbar  spine  bone  mineral 
density  in  postmenopausal  women:  A  meta-analysis  of  individual  patient  data. 
Journal  of  Gerontology.  Medical  Sciences.  2002;  57A:M599-M604.  (See  Appendix 
C) 

4.  Kelley,  G.A.,  Kelley,  K.S.,  Tran,  Z.V.  Exercise  and  bone  mineral  density  at  the 
femoral  neck  in  postmenopausal  women:  A  meta-analysis  of  controlled  clinical 
trials  using  individual  patient  data.  Disease  Management  and  Clinical  Outcomes. 
Note:  This  manuscript,  previously  reported  as  “in  press,”  will  not  be  published  in 
the  above-referenced  journal  because  it  has  ceased  publication.  We  plan  on  seeking 
publication  of  this  work  elsewhere.  For  verification  regarding  this  journal,  please 
contact  Dr.  Clifton  Lacy  (Phone:  609-292-7837;  E-mail:  lacvl  @optonline.nef>  (See 
Appendix  D) 

V.  CONCLUSIONS 

A.  Importance  of  Completed  Research 

No  work  was  conducted  for  the  past  year  because  this  grant  has  not  yet  been  transferred 
from  Massachusetts  General  Hospital  in  Boston  to  West  Virginia  University  in 
Morgantown. 

B.  Suggestions  for  Future  Work 

A  need  exists  for  addressing  the  inconsistencies  between  results  of  suimnary  means  and 
individual  patient  data  in  relation  to  the  effects  of  exercise  at  the  hip.  This  includes  an 
examination  of  whether  the  smaller  number  of  individual  patient  data  studies  available 
at  the  femoral  neck  Umited  the  power  of  our  more  recent  meta-analysis  or  the  inclusion 
of  studies  using  any  femur  site  skewed  the  results  of  our  earlier  meta-analytic  work. 

C.  So  What? 

This  cannot  be  addressed  because  no  work  was  conducted  for  the  past  year  on  this 
project.  We  are  still  awaiting  transfer  of  this  grant  from  Massachusetts  General 
Hospital  in  Boston  to  West  Virginia  University  in  Morgantown. 

VI.  REFERENCES- Not AppUcable 

Vn.  APPENDICES 

A.  Aerobic  Exercise  and  Regional  Bone  Density  Manuscript 

B.  Retrieval  of  Individual  Patient  Data  Manuscript 


C.  Exercise  and  Lumbar  Spine  Bone  Mineral  Density  Manuscript 

D.  Exercise  and  Bone  Mineral  Density  at  the  Femoral  Neck  Manuscript 
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ilegiQiial  Bone  Density  in 
^ntirolled  Trials 


George  A.  Kelley,  DA;  Kristi  S.  Kelley,  MEd 

Institute  of  Health  Professions,  Massachusetts  General  Hospital,  Boston,  MA 


In  this  study  the  meta-analytic  approach  was 
used  to  examine  the  effects  of  aerobic  exercise 
on  regional  bone  mineral  density  at  the 
lumbar  spine,  femur,  and  radius  in  women. 
Twenty-four  studies  representing  58  groups 
(3 1  exercise,  27  control)  and  1 029  subjects 
1 7  exercise,  512  control)  met  the  criteria 
for  inclusion.  Using  a  random-effects  model, 
small  but  statistically  significant  effect  size 
changes  in  bone  mineraidensity  were  observed 
at  the  lumbar  spine  (x  ±SD=0.33±0.49; 
95%  confidence  interval=0. 1 6-0.50)  and 
femur  (X  ±SD=0.25±0.35;  95%  confidence 
intervahO.  14-0.35).  Changes  in  lumbar 
spine  bone  mineral  density  were  equivalent  to 
a  0.37%  increase  in  the  exercise  groups  and 
a  1 .87%  decrease  in  the  control  groups.  For 
the  femur,  changes  were  equivalent  to  a 
1.37%  increase  in  the  exercise  groups  and  a 
0.58%  decrease  in  the  controi  groups.  No 
statistically  significant  changes  were  observed 
at  the  radius  (x  ±SD=0. 1 0±0.45;  95%  con¬ 
fidence  interval=-  0.20  to  0.4 1 ).  The  overall 
results  of  this  study  suggest  that  aerobic  exer¬ 
cise  has  a  small  but  positive  effect  on  bone 
mineral  density  at  the  lumbar  spine  and 
femur  in  women.  (Am  j  Med  Sports.  2002; 
4:427—433,  452)^ 2002  UlacqCommunkxjtkms,  Inc. 
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Osteoporosis,  defined  as  abnormally  low  bone 
mass,  is  a  major  public  health  problem  in  the 
United  States,  as  well  as  other  countries.  In  1996,  it 
was  estimated  that  approximately  23  million  women 
in  the  United  States  had  osteoporosis  or  were  at  risk  for 
developing  the  disease.J  By  the  year  2015  this  figure  is 
expected  to  increase  to  approximately  35  million.2  It  is 
w^  established  that  low  bone  mineial  density  (BMD) 
is  associated  with  increased  fracture  risk.  The  health 
care  costs  associated  vnth  osteoporotic  fractures  has 
been  reported  to  exceed  $13.8  bilhon  annually.3  Given 
the  health  and  economic  costs  associated  with  osteo¬ 
porosis,  a  need  exists  for  appropriate  nonpharmaco- 
logic  and  pharmacologic  interventions  for  dealing 
with  this  disease.  One  such  nonpharmacologic  inter¬ 
vention  may  be  aerobic  exercise,^  a  low-cost  interven¬ 
tion  that  is  available  to  most  of  the  general  public. 

We  have  previously  reported  that  aerobic  exercise 
might  help  to  maintain  and/or  increase  BMD  in 
postmenopausal  women  but  that  additional  studies 
were  needed  before  any  firm  conclusions  could  be 
reached.5-7  Since  the  time  of  these  published  meta¬ 
analyses,  a  number  of  additional  studies  have  been 
conducted  and/or  located.  It  is  critical  that  up-to- 
date  meta-analyses  be  performed  in  order  to  provide 
the  most  recent  information  possible  on  the  state  of 
knowledge  regarding  the  topic  of  interest.  Given  the 
health  care  consequences  of  low  BMD,  it  is  impor¬ 
tant  to  understand  the  role  that  aerobic  exercise  may 
play  as  a  nonpharmacologic  intervention  for  en¬ 
hancing  and/or  maintaining  BMD  in  women.  Thus, 
we  used  the  meta-analytic  approach  to  examine  the 
effects  of  aerobic  exercise  on  regional  BMD  at  the 
lumbar  spine,  femur,  and  radius  in  women. 


DATA  SOURCES.  Computerized  literature  searches  of 
articles  indexed  between  January,  1966  and  Decem¬ 
ber,  1998  were  performed  using  MEDLINE,  Embase, 
Current  Contents,  Sport  Discus,  and  Dissertation  Ab¬ 
stracts  International  Databases.  The  key  words  used 
in  this  literature  search  were  "exercise"  and  "bone." 
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While  this  broad  approach  to  searching  the  litera¬ 
ture  will  result  in  the  retrieval  of  a  greater  number  of 
articles  to  review,  it  should  decrease  the  number  of 
studies  missed  when  a  more  narrow  and  focused 
search  is  conducted.  In  addition  to  computerized  lit¬ 
erature  searches,  the  reference  lists  from  both  origi¬ 
nal  and  review  articles  were  examined  in  order  to 
identify  any  studies  that  had  not  been  previously 
identified  and  that  appeared  to  contain  information 
that  may  have  met  our  inclusion  criteria.  Finally, 
three  experts  on  exercise  and  bone  density  (Dr. 
David  Nichols,  Dr.  Charlotte  Sanborn,  and  Dr. 
Christine  Snow)  reviewed  our  reference  list  for  thor¬ 
oughness  and  completeness. 

StUbV  SELECTION.  The  inclusion  criteria  for  this 
study  were  as  follows:  1)  trials  were  randomized  or 
nonrandomized  trials  and  included  a  comparative 
nonexercise  group;  2)  aerobic  exercise  was  the  only 
intervention;  3)  subjects  were  adult  female  humans 
(mean  age,  18  years  or  older);  4)  studies  were  report¬ 
ed  as  journal  articles,  dissertations,  and  master's 
theses  published  m  the  English  language  literature; 
5)  studies  were  published  and  indexed  between  Jan¬ 
uary,  1966  and  December,  1998;  6)  BMD  (relative 
value  of  bone  mineral  per  measured  bone  area)  was 
assessed  at  the  femur,  lumbar  spine,  or  radius;  and 
7)  training  studies  which  lasted  a  minimum  of  16 
weeks.  Only  studies  that  met  the  above  criteria  were 
included  in  our  analysis.  Thus,  for  example,  if  BMD 
was  also  assessed  in  women  performing  progressive 
resistance  exercise  as  the  primary  training  modality, 
we  did  not  include  this  information  since  it  did  not 
meet  our  inclusion  criteria.  Because  dissertations 
and  master's  theses  may  eventually  become  full- 
length  journal  articles,  we  cross-referenced  between 
the  two  in  order  to  avoid  duplication.  We  did  not 
include  abstracts  and  conference  papers  from  na¬ 
tional  meetings  because  of  the  paucity  of  data  pro¬ 
vided  as  well  as  the  inability  to  obtain  complete 
data  fi-om  the  authors.  Studies  published  in  foreign 
language  journals  were  also  not  included  because  of 
the  potential  error  in  the  translation  and  interpreta¬ 
tion  of  findings.  Studies  that  met  our  inclusion  cri¬ 
teria  were  also  examined  to  ensure  that  the  same 
subjects  were  not  included  in  more  than  one  study.® 
For  studies  that  met  oUr  inclusion  criteria  but  did 
not  provide  appropriate  information  on  changes  in 
BMD,  personal  contact  was  made  with  the  authors 
in  an  attempt  to  retrieve  such  information. 

DATA  ABSTRACTION.  Coding  sheets  that  could 
hold  242  items  per  study  were  developed  and  uti¬ 
lized  in  this  study.  In  order  to  avoid  inter-coder 
bias,  all  data  were  independently  abstracted  by 
both  authors.  The  authors  then  met  and  reviewed 
every  data  point  for  accuracy  and  consistency.  Dis¬ 
agreements  were  resolved  by  consensus.  The  major 
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categories  of  variables  coded  included  study  char¬ 
acteristics,  physical  characteristics  of  subjects,  and 
priihary  and  secondary  outcomes. 

STATISTICAL  ANALYSIS.  Primary  Outcomes.  The  pri¬ 
mary  outcornes  in  this  study  were  changes  in  BMD 
at  the  lumbar  spine,  femur,  and  radius,  calculated 
using  the  standardized  difference  effect  size  (ES)  ap¬ 
proach.  This  was  accomplished  by  subtracting  the 
change  outcome  in  the  exercise  group  from  the 
change  outcome  in  the  control  group,  then  divid¬ 
ing  this  difference  by  the  pooled  standard  deviation 
of  the  exercise  and  control  groups.^  This  measure 
provides  one  with  a  statistic  similar  to  a  z  score.  In 
general,  an  ES  of  D.  20  Js  considered  a  small  effect, 
0.50  a  moderate  effect,  and  0.80  a  large  effect.i®  An 
ES  of  0.30  for  example,  means  that  the  exercise 
group  differed  from  the  control  group  by  three- 
tenths  of  a  standard  deviation  m  favor  of  the  exer¬ 
cise  group.  Using  a  z  score  table,  this  means  that  the 
exercise  group  would  do  better  than  approximately 
62%  of  the  control  group.  We  used  this  approach 
vs.  the  original  metric  because  of  the  various  ways 
in  which  the  authors  reported  data  on  changes  in 
BMD  and  because  we  also  wanted  to  maximize  the 
number  of  studies  and  outcomes  that  could  be  in¬ 
cluded  in  our  analysis.  All  ESs  were  then  corrected 
for  Small-Sample  bias.®  For  studies  that  did  not  re¬ 
port  change  outcome  variances,  these  were  estimat¬ 
ed  using  previously  developed  methods.^  T- 
distribution  95%  confidence  intervals  (Cl)  were 
calculated  for  all  outcomes.  If  the  95%  Cl  included 
zero  (0.00),  it  was  concluded  that  there  was  no  sta¬ 
tistically  significant  effect  of  exercise  on  BMD.  A 
random-effects  model  was  used  for  all  analyses.® 

,  Heterogeneity  of  ESs  was  examined  using  the  Q 
statistic.®  For  studies  that  included  multiple  out- 
comei  because  of  more  than  one  group,  net 
changes  were  initially  treated  as  independent  data 
points.  However,  in  order  to  examine  the  influence 
(sensitivity)  of  each  study  on  the  overall  results, 
analyses  were  performed  with  each  study  deleted 
from  the  model  for  ES  changes  at  the  lumbar  spine, 
femur,  and  radius.  Publication  bias  (the  tendency 
for  journals  and/br  authors  to  publish  studies  that 
yield  statistically  significant  results)  was  examined 
using  a  funnel  plot.  12  This  was  accomplished  by 
plotting  the  sample  size  on  the  vertical  axis  and  ES 
changes  in  BMD  on  the  horizontal  axis.  Usually, 
smaller  studies  tend  to  disperse  at  the  bottom  of  the 
funnel  while  larger  studies  tend  to  congregate  at  the 
top.  A  gap  at  the  bottom  of  the  funnel  on  the  left 
side  indicates  That  small  studies  yielding  null  or 
negative  results  may  be  missing.  Study  quality  was 
assessed  using  a  three-item  questionnaire  designed 
to  assess  bias— specifically,  randomization,  blinding, 
and  withdrawals/dropouts.13  The  number  of  points 
possible  ranged  from  a  low  of  0  to  a  high  of  5.  All 
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questions  were  designed  to  elicit  yes  (1  point)  or  no 
(0  points)  responses.  The  questionnaire,  which  took 
less  than  10  minutes  per  study,  has  been  shown  to 
be  both  valid  (face  validity)  and  reliable  (researcher 
inter-rater  agreement:  r=0.77;  95%  GI=0.60-0.86).i3 

Subgroup  Analyses.  Subgroup  analyses  for  ES 
changes  at  the  lumbar  spine  and  femur  were  per¬ 
formed  using  analysis  of  variance  (ANOVA)-like 
procedures  for  meta-analysis.®  These  procedures 
provide  statistics  for  both  within  (Qw)-  and  between 
(Qb)-group  differences.  A  random-effects  model  was 
used  for  all  analyses.  Subgroup  analyses  were  per¬ 
formed  for:  ES  changes  at  the  lumbar  spine  and 
femur  according  to  t3q)e  of  publication  (journal  vs. 
dissertation);  country  in  which  the  study  was  con¬ 
ducted  (United  States  vs.  other);  study  design  (ran¬ 
domized  vs.  nonrahdOmized,  controlled  trial); 
whether  subjects  were  postmenopausal;  whether 
subjects  were  taking  calcium  supplementation;  type 
of  BMD  assessment  (dual-energy  x-ray  absorptiome¬ 
try,  dual  photon  absorptiometry,  quantitative  com¬ 
puted  tomography);  and  higher  vs.  lower  impact 
activity.  Higher  impact  activities  included  exercises 
such  as  running,  jumping,  and  aerobic  dancing 
with  both  feet  off  the  ground,  while  lower  impact 
activities  included  exercises  such  as  walking  and 
low  impact  aerobic  dancing  with  both  feet  on  the 
ground.  ES  changes  in  BMD  at  the  femur  were  also 
examined  when  data  were  partitioned  according  to 
whether  drugs  were  taken  that  could  enhance  BMD, 
cigarette  smoking,  diet,  previous  physical  activity, 
and  the  specific  site  of  BMD  assessment  (femoral 
neck,  trochanter.  Ward's  triangle,  intertrochanter). 
Insufficient  data  were  provided  to  examine  ES 
changes  in  BMD  at  the  lumbar  spine  according  to 
whether  drugs  were  taken  that  could  enhance  BMD, 
cigarette  smoking,  diet,  previous  physical  activity, 
and  the  specific  site  at  which  BMD  was  assessed.  For 
both  the  lumbar  spine  and  femur,  insufficient  data 
were  provided  to  examine  changes  in  BMD  when 
partitioned  according  to  alcohol  consumption  and 
previous  fractures.  We  were  unable  to  partition  the 
results  according  to  training  modality  because  of 
the  variety  of  activities  in  which  the  subjects  partici¬ 
pated.  We  did  not  perform  subgroup  analysis  for 
changes  in  BMD  at  the  radius  because  of  the  small 
sample  size.  In  addition,  we  were  not  able  to  exam¬ 
ine  differences  between  the  radius  and  other  sites  at 
the  forearm  (for  example,  the  ulna)  because  of  in¬ 
sufficient  data. 

Regression  Analysis.  The  potential  associations  be¬ 
tween  ES  changes  in  BMD  at  the  lumber  spine  and 
femur  were  conducted  using  simple  weighted 
least-squares  regression,  according  to  procedures 
developed  by  Hedges  and  Olkin.®  Variables  includ¬ 
ed  study  quality,  percent  dropout,  initial  BMD,  age. 


height,  initial  body  weight,  changes  in  body 
weight,  initial  body  mass  index,  changes  in  body 
mass  index,  initial  percent  fat,  changes  in  percent 
fat,  initial  lean  body  mass,  changes  in  leari  body 
mass,  initial  maximum  oxygen  consumption 
(mL/kg-i/min-t),  changes  in  maximum  oxygen  con¬ 
sumption  (mL/kg-i/min-i),  years  past  menopause, 
initial  calcium  intake,  changes  in  calcium  intake, 
reliability  of  BMD  measurements,  length,  frequen¬ 
cy,  intensity,  and  duration  of  training,  total  min¬ 
utes  of  training  (length  x  frequency  x  duratiori), 
and  compliance,  defined  as  the  percentage  Of  exer¬ 
cise  sessions  attended.  Insufficient  data  were  avail¬ 
able  to  examine  ES  changes  in  BMD  and  resting 
heart  rate.  We  did  not  conduct  regression  analyses 
for  ES  changes  in  BMD  at  the  radius  because  of  the 
small  sample  size.  We  were  unable  to  conduct  any 
type  of  multiple  regression  analyses  hecause  of  " 
missing  data  for  different  sets  of  variables. 

Secondary  Outcomes.  Secondary  outcomes  (changes 
in  body  weight,  body  mass  index,  percent  body  fat, 
lean  body  mass,  maximum  oxygen  consumption, 
resting  heart  rate,  and  calcium  intake)  were  calculat¬ 
ed  as  the  difference  (exercise  minus  control)  of  the 
changes  (initial  minus  final)  in  these  mean  values. 
With  the  exception  of  the  use  of  the  original  metric 
vs.  standardized  difference  approach,  changes  in 
secondary  outcomes  were  examined  using  the  same 
procedures  as  those  for  BMD. 

An  independent  t  test  (2-tailed)  was  used  to  com¬ 
pare  differences  in  study  quality  between  journals 
and  dissertations.  Unless  otherwise  noted,  aU  results 
are  reported  as  X  +SD.  The  alpha  level  for  statistical 
sigriificance  was  set  at  p<0.05. 


STUDY  CHARACTERISTICS.  Twenty-seven  studies  met 
the  criteria  for  inclusioni*-^0;  however,  we  were  un¬ 
able  to  retrieve  necessary  data  from  three  stud- 
ies.15,27,28  xhis  resulted  in  a  loss  of  approximately 
11%.  Thus,  24  studies  representing  31  exercise  and  27 
control  groups  (some  studies  had  more  than  one 
group)  were  included  in  our  final  analysis.i^-i6-26, 29-40 
From  these  24  studies,  31  effect  sizes  were  generated 
for  the  lumbar  spine,  42  for  the  femur,  and  11  for  the 
radius.  Twenty-two  of  the  studies  were  published  in 
refereed  journalsi^, 17-26, 29,31-40  while  the  other  two 
were  dissertations.i6-3o  Thirteen  studies  were  conduct¬ 
ed  in  the  United  States,  14,16,19,20,22,26,30,31,33,34,37-39 
three  each  in  Australiai8,35,36  and  the  United  King- 
dom,i7-2i,32  two  each  in  Finland24,2s  and  Japan,23,40 
and  one  in  China.z®  Thirteen  of  the  studies  were 
randomized,  controlled  trials,  16-17,21-25,29,31,32,35,36,39 
while  11  were  nonrandomized,  controlled  tri- 
ais.14,18-20,26,30, 33,34,37,38,40  Study  quality  ranged  from  0 
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to  5  (X  ±SD=1.7S±1.51>.  There  was  no  statistically  sig¬ 
nificant  difference  in  study  quality  between  studies 
published  in  journals  and  dissertations  (p=0.65).  A 
total  of  1029  subjects  (517  exercise,  512  control)  com¬ 
pleted  pre-  and  post-assessments  of  BMD.  The  average 
number  of  subjects  ranged  from  five  to  49  in  the  exer¬ 
cise  groups  (X  ±sp=17±12)  and  from  four  to  48  in  the 
control  groups  (X  ±SD=19±15).  The  percent  dropout, 
defined  as  the  percentage  of  subjects  who  did  not 
complete  the  study,  ranged  from  0%-63%  in  the  exer¬ 
cise  groups  (X  ±SD=20%±16%)  and  from  0%-43%  in 
the  control  groups  (X  ±SD=10%±1 1%). 

SUBJECT  CHARACTERISTICS.  A  description  of  the 
subject  characteristics  is  shown  in  Table  I.  In  six 
Ttudies,  all  or  tne  subjects  \wre  whitei^-22.3o,3i,33,36j 
in  one  study,  all  subjects  with  the  exception  of  one 
(a  black  person)  were  whitezo;  in  one  study,  all  sub¬ 
jects  were  Chinese^f;  and  in  one  study,  all  subjects 
were  Japanese.^o  In  19  studies,  all  subjects  were 

postmenopausah4-i7-23,26, 29-33, 35-38,40;  in  two  stud¬ 
ies,  only  some  subjects  were  postmenopausal25,34; 
and  in  three  studies,  no  subjects  were  post¬ 
menopausal.  i6<24, 39  In  14  studies,  no  subjects  were 
taking  any  type  of  hormone  replacement  during 
the  studyi4, 17-19,23, 30-36,38,40  and  in  six  studies,  some 
of  the  subjects  were  taking  some  type  of  hormone 
replacement  therapy. 20-22,24,25,37  one  study  had  two 
separate  groups  of  subjects  in  which  one  group 
took  some  type  hormone  replacement  therapy 
while  the  other  did  not.26  In  nine  studies,  all  sub¬ 
jects  were  taking  some  type  of  calcium  supplemen¬ 
tation  during  the  studyi®'2o.26,3i-33,35,37,39;  jn  five 

studies,  no  subjects  were  taking  any  type  of  calci¬ 
um  supplementationi9.22, 23,36,40;  and  in  two  stud¬ 
ies,  some  of  the  subjects  took  some  type  of  calcium 


Table  I.  Subject  Characteristics 


Variable 

N 

Exercise 

(X±SD) 

N 

Control 

(X±SD) 

Age  (years) 

31 

57.9±12.7 

27 

58.2±13.2 

Height  (cm) 

22 

160.7±4.3 

19 

161.5L4.4 

Weight  (kg) 

25 

64.7±6.6 

21 

64.2±6.4 

BMI(kg/m2) 

24 

24.9L1.9 

21 

24.6±1.9 

Fat(%) 

13 

38.2±4.8 

10 

37.9±6.5 

Lean  mass  (kg) 
Initial  Vozmax 

13 

41.2±3.5 

10 

39.8±2.8 

(mL/kg-imin-i) 

16 

23.4±4.2 

11 

23.9±5.0 

Initial  RHR  (bpm) 
Postmenopausal 

4 

76.7±3.7 

2 

74.15±4.5 

(years) 

22 

10.0±5.4 

18 

11.7±5.8 

Calcium  (mg) 

19 

934±340 

16 

938±344 

N=number  of  groups  reporting  mean  data;  BMI=body 
mass  index;  Vo7mav=maximaI  oxygen  consumption; 
RHR=resting  heart  rate;  bpm=beats  per  minute. 


supplementation.21.30  Another  study  had  two  sepa¬ 
rate  groups  of  subjects,  one  who  took  some  type  of 
calcium  supplementation  and  another  who  did 
not.  29  In  one  other  study,  all  of  the  subjects  in  the 
control  group  took  some  type  of  calcium  supple¬ 
mentation,  while  some  in  the  exercise  group  did 
S0.14  In  eight  studies,  food  intake  did  not  change 
during  the  studyi^, 17,19,22,24,26,32,38  and  in  one 
study,  it  did.32  In  six  studies,  none  of  the  subjects 
smoked  cigarettes24-26,30,36,38  and  in  four  studies, 
some  of  the  subjects  smoked. 17,19,21,37  in  one 
study,  none  of  the  subjects  in  the  control  group 
smoked  but  some  of  the  subjects  in  one  of  the  two 
exercise  groups  smoked.3i  In  another  study,  some 
of  the  subjects  in  the  exercise  group  smoked  but 
Tio  subjects  in  the  coiitrol  group  did.i8  In  two  stud¬ 
ies,  some  subjects  consumed  alcohol  during  the 
study.18,32  In  two  studies,  no  subjects  had  previous 
fractures, 29,38  while  in  another  study,  subjects  did 
have  previous  fractures.21  In  13  studies,  none  of 
the  subjects  had  been  active  prior  to  taking  part  in 
the  studyi^<i7,2o,  22,24-26,30,31,33,37,38,40  and  in  six 
studies,  some  of  the  subjects  had  been  previously 
active.i6'i9.2i.3s,36,39  In  one  study,  no  subjects  in 
the  control  group  had  been  active  prior  to  taking 
part  in  the  study  but  subjects  in  the  exercise  group 
had  been  previously  active.34 

BONE  DENSITY  ASSESSMENT  CHARACTERISTICS. 
Twelve  studies  assessed  BMD  at  the  lumbar  spine 
using  dual-energy  x-ray  absorptiometry  (DEXA),i6.i7, 
21,23-26,29,35,38-40  sevcn  studics  used  dual-photon  ab¬ 
sorptiometry  (DPA),  14,18,20,22,30,31,34  and  two  used 
quantitative  computed  tomography  (QCI).i9,32  One 
other  study  used  both  DPA  and  QCT  to  assess  BMD  at 
the  lumber  spine.33  For  studies  that  included  such 
data,  the  vast  majority  reported  the  assessment  of 
BMD  at  the  L2-L4  sites.i6,i7,20,22-2s,29-3i,34,38,39  Three 
studies  reported  the  assessment  of  BMD  at  the  Ll-LA 
sites,  14,35,40  one  at  the  L1-L2  sites,  i9  and  another  at 
the  L1-L3  and  L2-L4  sites.33  Between-study  mean  reli¬ 
ability  (coefficient  of  variation)  of  BMD  assessment  at 
the  lumber  spine  ranged  from  0.4%-3%.  Ten  studies 
used  DEXA  to  assess  BMD  at  the  femur,i5,i7,2i, 
24-26,29,35,38,39  while  another  five  used  DPA.i4,i8, 30,33,34 
Fifteen  studies  included  assessment  of  BMD  at  the 
femoral  neck,  14,16-18,21,24-26,29,30,33-35,38,39  seven  at 
Ward's  triangle, 16,18,26,29,34,38,39  eight  at  the 
trochanter,i6,i8,24,26,34,3s,38,39  and  two  at  the  in- 
tertrochanter.29,35  One  study  involved  BMD  assess¬ 
ment  at  the  distal  femur,24  and  another  involved 
assessment  of  the  total  femur.35  The  mean  be¬ 
tween-study  reliability  (coefficient  of  variation)  for 
BMD  assessment  at  the  femur  ranged  from 
0.5%-4.4%.  In  eight  studies,  BMD  was  assessed  at  the 
forearm24-26,3o, 33,34,36,37;  however,  we  were  unable  to 
identify  whether  one  of  the  studies  assessed  BMD 
at  the  radius.36  Four  studies  used  single-photon 
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absorptiometry  (SPA)  to  assess  BMD  at  the  ra¬ 
dius, 30,33,34, 37  while  three  used  DEXA.24-26  The  mean 
between-study  reliability  (coefficient  of  variation) 
ranged  from  0.5%-5.0%. 

TRAINING  PROGRAM  CHARACTERISTICS.  A  de¬ 
scription  of  the  training  program  characteristics  is 
shovm  in  Table  II.  Overall,  the  activity  most  com¬ 
monly  included  in  these  exercise  interventions 
was  walking.  Specifically,  five  studies  limited  the 
training  modality  to  primarily  walking,i7.i9.2i,23,33 
two  to  jogging,i6-39  and  two  to  a  combination  of 
walking  and  jogging.3i-38  Two  other  studies  had 
subjects  participate  primarily  in  aerobic  danc- 
ing,32,34  while  another  two  employed  walking3s,36 
or  aerobic  dancingis-^z  as  well  as  other  activities. 
One  study  limited  participants'  exercise  to  stair 
stepping  and  other  miscellaneous  activities,29 
while  another  limited  exercise  to  stationary  cy¬ 
cling.  Two  other  studies  had  participants  take 
part  in  a  combination  of  walking,  jogging,  cy¬ 
cling,  stair  stepping,  and  other  activities^o.zs;  one 
had  subjects  perform  walking,  jogging,  and  stair 
stepping26;  and  another  had  subjects  walk,  swim, 
and  perform  other  various  activities.^o  One  study 
had  subjects  perform  aerobic  dancing,  stair  step¬ 
ping,  and  other  assorted  activities, 24  while  anoth¬ 
er  had  subjects  perform  a  variety  of  different  but 
unspecified  activities.32  Finally,  one  study  had  one 
group  of  subjects  who  walked  and  another  group 
who  swam.3o 

PRIMARY  OUTCOMES.  Lumbar  Spine.  The  overall  re¬ 
sults  for  ES  changes  in  lumbar  spine  BMD  are 
shown  in  Table  III.  As  can  be  seen,  small  but  statisti¬ 
cally  significant  ES  changes  in  lumber  spine  BMD 
were  observed.  These  changes  were  equivalent  to  a 
0.37%  increase  in  the  exercise  groups  and  a  1.87% 
decrease  in  the  control  groups.  No  statistically  sig¬ 
nificant  heterogeneity  was  found  for  changes  in 
lumbar  spine  BMD.  Funnel  plot  analysis  was  sugges¬ 
tive  of  publication  bias.  With  each  study  deleted 
from  the  model  once,  ES  changes  in  BMD  ranged 
from  a  low  of  0.27±0.42  (95%  €1=0.12-0.44)  to  a 
high  of  0.36±0.48  (95%  €1=0.18-0.54). 

Femur.  The  overall  results  for  ES  changes  in  BMD  at 
the  femur  are  shown  in  Table  III.  As  can  be  seen, 
small  but  statistically  significant  changes  in  BMD  at 
the  femur  were  observed.  These  changes  were  equiva¬ 
lent  to  a  1.37%  increase  in  the  exercise  groups  and  a 
0.58%  decrease  in  the  control  groups.  No  statistically 
significant  heterogeneity  was  found  for  changes  in 
BMD  at  the  femur.  Funnel  plot  analysis  was  sugges¬ 
tive  of  publication  bias.  With  each  study  deleted  from 
the  model  once,  E5  changes  in  BMD  at  the  femur 
ranged  from  a  low  of  0.21±0.34  (95%  €1=0.10-0.32) 
to  a  high  of  0.26±0.38  (95%  €1=0.14-0.38). 


Tabic  II.  1  raining  Program  Characteristics 


Variable 

N 

(X±SD) 

Length  (weeks) 

31 

53±23 

Frequency  (days/week) 

28 

3±1 

Intensity  (%  V02n,ax) 

7 

75±8 

Duration  (min/session) 

22 

33±11 

Total  min* 

22 

5046±3159 

€ompliance  (%) 

21 

83±12 

N=number  of  groups  reporting  mean  data;  V02max=niaxi- 
mal  oxygen  consumption;  ‘total  minutes  calculated  as 
the  product  of  length,  frequency,  and  duration. 


Radius.  The  overall  results  for  E5  changes  in  BMD  at 
the  radius  are  shown  in  Table  III.  As  can  be  seen, 
changes  in  BMD  at  the  radius  were  not  statistically 
significant.  ES  changes  were  equivalent  to  a  0.08% 
decrease  in  BMD  for  the  exercise  groups  and  a  0.75% 
decrease  in  the  control  groups.  No  statistically  signifi¬ 
cant  heterogeneity  was  found  for  changes  in  BMD  at 
the  radius.  Funnel  plot  analysis  was  not  suggestive  of 
publication  bias.  With  each  study  deleted  from  the 
model  once,  ES  changes  in  BMD  at  the  radius  ranged 
from  a  low  of  0.02±0.37  (95%  €I=-0.25  to  0.28)  to  a 
high  of  0.17±0.42  (95%  €I=-0.13  to  0.48). 

Subgroup  and  Regression  Analysis.  Greater  ES  changes  in 
BMD  at  the  femur  were  observed  for  those  subjects 
v^o  received  some  type  of  calcium  supplementation 
(X  ±SD,  calcium  supplementation=0.33±0.42;  no  cald- 
lun  supplementation,  -0.24±0.44;  Qi,=4.55;  p=0.03). 
None  of  the  other  subgroup  analyses  at  the  lumber 
spine  and  femur  were  statistically  significant  or  clin¬ 
ically  important. 

SECONDARY  OUTCOMES.  A  statistically  significant 
inaease  was  observed  for  changes  in  maximum  oxy¬ 
gen  consumption  (X±SD=1.86±2.17  mL/kg-imin-i; 
95%  €1=0.31-3.41).  No  statistically  significant  or  clin¬ 
ically  important  changes  were  found  for  any  of  the 
other  secondary  outcomes. 

Discussion 

One  of  the  primary  roles  of  a  meta-analysis  is  to  at¬ 
tempt  to  arrive  at  some  overall  conclusion(s)  regard¬ 
ing  a  particular  body  of  research.  The  overall  results 
of  this  study  suggest  that  aerobic  exercise  has  a  small 
but  positive  effect  on  BMD  at  the  lumbar  spine  and 
femur  in  both  premenopausal  and  postmenopausal 
women,  and  that  this  effect  appears  to  be  the  result 
of  increasing  and/or  preserving  BMD.  The  fact  that  a 
similar  effect  was  not  found  at  the  radius  is  not  sur¬ 
prising,  given  that  it  appeared  that  all  of  the  exercise 
interventions  that  the  studies  employed  focused  on 
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Variable 

ES(#)  X±SD  95%  Cl 

Q(p)  . 

Lumbar  spine 

31  0.33±0.49  0.16  to  0.50* 

33.65(0.29) 

Femur 

42  0.25±0.35  0.14  to  0.35* 

32.93(0.81) 

Radius 

10  0.10±0.45  -0.20  to  0.41 

09.99(0.44) 

BMD=bone  mineral  density;  CI=confidence  interval. 

*  Significantly  different  from  zero. 

loading  the  lower  extremities.  Thus,  specific  loading 
aj^  alj_ste,  including  the  radius,  may  bemecessarjr  in . 
order  to  help  increase  and/or  preserve  BMD  at  that 
particular  site.  The  overall  results  observed  in  this 
study  are  similar  to  those  of  our  previous  and  less 
complete  work,  in  which  comparable  changes  in 
BMD  were  reported.s-^ 

While  the  results  of  this  study  are  positive  with 
respect  to  changes  in  BMD  at  the  lumbar  spine  and 
femur,  the  clinical  importance  of  such  small 
changes  (approximately  2%)  is  not  known,  especial¬ 
ly  as  they  relate  to  fracture  risk.  Indeed,  it  may  be 
that  postmenopausal  women  might  need  other 
types  of  nonpharmacologic  and  pharmacologic  in¬ 
terventions  in  addition  to,  or  in  lieu  of,  aerobic  ex¬ 
ercise  in  order  to  realize  a  significant  impact  on 
increasing  and/or  preserving  BMD  and  subsequent¬ 
ly  reducing  fracture  risk.  For  example,  a  recent 
meta-analysis  found  that  10  mg  per  day  of  alen¬ 
dronate  over  a  period  of  3  years  in  postmenopausal, 
osteoporotic  women  reduced  the  estimated  cumula¬ 
tive  incidence  of  nonvertebral  fractures  from  12.6% 
in  the  placebo  group  to  9.0%  in  the  alendronate 
group.41  This  coincided  with  an  increase  in  BMD 
of  approximately  8.8%  at  the  spine,  7.8%  at  the 
trochanter,  and  5.9%  at  the  femoral  neck.^i  Since 
the  changes  in  BMD  observed  in  this  meta-analysis 
were  much  smaller,  it  is  difficult  to  generalize  as  to 
how  these  changes  impact  subsequent  fracture  risk. 
It  would  appear  plausible  to  suggest  that  future 
studies  examining  the  effects  of  exercise  on  changes 
in  BMD  attempt  to  address  the  clinical  importance 
of  these  changes  on  subsequent  fi-acture  risk. 

The  fact  that  there  were  greater  changes  in  BMD 
at  the  femur  in  those  studies  that  included  calcium 
supplementation  suggests  that  its  combination  with 
exercise  may  be  necessary  in  order  to  increase 
and/or  preserve  BMD  in  women.  This  supports  pre¬ 
vious  work  in  which  calcium  supplementation  was 
foxmd  to  be  necessary  in  order  to  maximize  the  ben¬ 
efits  of  exercise  on  BMD.42 

We  were  surprised  to  find  that  both  higher  and 
lower  impact  activity  yielded  similar  benefits  at  both 
the  femur  and  lumbar  spine,  especially  since  it  is  gen¬ 
erally  believed  that  higher  impact  activity  will  have  a 


more  positive  effect  on  BMD.  However,  our  results 
support  other  reports  of  simiiar  BMD  results  for  both 
higher  and  lower  impact  activities.22  This  notwith¬ 
standing,  our  results  need  to  be  interpreted  with  cau¬ 
tion,  since  the  issue  of  mechanical  loading  and 
skeletal  integrity  is  still  a  controversial  area  in  need  of 
additional  research.43  Furthermore,  since  few  authon 
reported  the  specific  ground-reaction  forces  associated 
with  the  intervention  employed,  we  were  limited  to 
developing  a  somewhat  arbitrary  classification  system. 

Despite  the  fact  that  meta-analysis  is  a  quantita¬ 
tive  approach  for  reviewing  a  body  of  literature, 
subjective  decisions  still  have  to  be  made.  For  exam¬ 
ple,  in  this  investigation,  we  chose  to  include  un- 
_published  studies  (disse4'tations)-in-nur^nalysis. 
While  the  Inclusion  of  unpublished  studies  in  scien¬ 
tific  overviews  is  controversial,  we  believe  that  if  ap¬ 
propriate  resources  are  available,  unpublished 
studies  should  not  be  systematically  excluded. 
Rather,  they  should  be  included  and  examined  for 
potential  differences  when  compared  to  published 
work.  This  is  especially  tme  given  the  fact  that  there 
is  a  bias  toward  publishing  studies  that  yield  statisti¬ 
cally  significant  and  positive  results.  For  example, 
Sterling  et  al.«  found  that  approximately  96%  of  se¬ 
lected  psychology  journals  and  85%  of  selected 
medical  journals  published  studies  that  yielded  a 
statistically  significant  result.  The  inclusion  of  un¬ 
published  work  in  scientific  overviews  is  a  feeling 
that  is  shared  by  the  vast  majority  of  meta-analysts 
and  methodologists;  approximately  78%  believe 
that  impublished  material  should  definitely  or  prob¬ 
ably  be  included  in  scientific  overviews.^®  Alterna¬ 
tively,  it  may  be  argued  that  the  inclusion  of 
unpublished  work  is  inappropriate  because  it  has 
not  gone  through  the  peer  review  process  and/or 
that  such  studies  were  never  submitted  for  publica¬ 
tion  consideration  because  of  the  feeling  that  they 
may  have  been  flawed  because  of  some  type  of 
methodologic  problem.  However,  the  fact  that  we 
found  no  statistically  significant  difference  in  study 
quality  between  published  and  unpublished  work 
and  found  no  difference  in  ES  results  when  our  data 
were  partitioned  according  to  type  of  publication 
led  us  to  include  this  information  in  our  analysis. 

Another  subjective  decision  we  made  was  the  in¬ 
clusion  of  nonrandomized,  controlled  trials.  We  be¬ 
lieve  that  it  is  important  to  include  nonrandomized 
trials,  at  least  in  the  exploratory  phase,  in  order  to 
see  if  they  differ  from  randomized  trials.  Since  our 
subgroup  analyses  revealed  no  statistically  signifi¬ 
cant  differences  in  ES  between  randomized  and 
nonrandomized  trials  at  any  of  the  sites  assessed,  we 
chose  to  include  these  in  our  final  analysis. 

While  it  appears  that  aerobic,  site-specific  exercise 
has  a  small  but  positive  effect  on  BMD  in  adult 
women,  these  results  need  to  be  interpreted  with  re¬ 
gard  to  the  following  caveats.  First,  the  fact  that  our 
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funnel  plot  analysis  was  suggestive  of  publication  bias 
for  both  lumbar  spine  and  femur  results  may  warrant 
caution  in  the  interpretation  of  our  findings.  We 
chose  to  use  this  quasi-statistical  approach  because 
the  statistical  approaches  that  have  been  developed  to 
date  are  not  grounded  in  formal  statistical  theory  and 
make  assumptions  that  are  doubtful  or  indefensible.^ 
However,  it  is  also  important  to  realize  that  the  sensi¬ 
tivity  of  funnel  plots  for  detecting  publication  bias 
has  not  been  assessed  systematically.^s  Second,  the 
very  nature  of  meta-analysis  dictates  that  the  meta- 
ancQysis  itself  inherits  the  limitations  of  the  studies  in¬ 
cluded  in  the  analysis.  For  example,  we  were  unable 
to  perform  subgroup  analyses  of  ES  changes  in  BMD 
at  the  lumbar  spine  according  to  whether  dmgs  were 
taken  that  could  enhance  BMD,  cigarette  smoking, 
diet,  previous  physical  activity  habits,  and  the  specific 
site "S  v^cfr'BMD'wS”assessedrin"^ddition,  insuffi^ 
cient  information  was  available  to  examine  ES 
changes  in  BMD  at  both  the  lumbar  spine  and  femur 
according  to  alcohol  consumption,  previous  fractures, 
and  training  modality.  Furthermore,  we  were  limited 
to  conducting  simple  vs.  multiple  regression  analysis 
because  of  missing  data.  The  ability  to  include  this 
missing  information  may  have  yielded  some  interest¬ 
ing  results.  However,  while  missing  data  is  a  common 
problem  in  meta-analirtic  research,  it  should  not  pre¬ 
clude  one  ftom  conducting  a  quantitative  review.  Iri 
fact,  one  of  the  reasons  for  conducting  a  meta-analy¬ 
sis  is  to  identify  areas  of  weakness  and  provide  direc¬ 
tions  for  future  research.  With  the  former  in  mind,  we 
believe  that  future  studies  should  include,  and  editors 
publish,  complete  information  regarding  whether  any 
drugs  were  taken  that  could  enhance  BMD,  cigarette 
smoking,  diet,  previous  physical  activity  habits,  alco¬ 
hol  consumption,  and  previous  fractures.  In  additiori, 
future  studies  should  probably  assess  and  report  the 
different  ground-reaction  forces  associated  with  the 
physical  activity  interventions  they  employ.  We  be¬ 
lieve  that  this  is  critical  to  the  establishment  of  more 
precise  guidelines  aimed  at  enhancing  BMD. 

In  conclusion,  the  overall  results  of  this  study  sug¬ 
gest  that  aerobic  exercise  has  a  small  but  positive  effect 
on  BMD  at  the  lumbar  spine  and  femur  in  women.  ■ 

This  study  was  funded  by  the  United  States  Department  of  De¬ 
fense,  Army  Medical  Research  and  Materiel  Command  Award 
#17-98-1-8513. 
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The  purpose  of  this  study  was  to  examine 
the  feasibility  of  acquiring  individual  patient 
data  (IPD)  for  a  meta-analysis  on  the  effects 
of  exercise  on  bone  mineral  density  in 
adults.  We  were  able  to  obtain  data  from 
29  (38.2%)  of  the  76  eligible  studies.  Bina¬ 
ry  multiple  logistic  regression  analysis  re¬ 
vealed  a  trend  suggesting  that  authors  of 
studies  conducted  in  the  United  States  were 
less  likely  to  supply  IPD  when  compared 
with  authors  who  conducted  studies  in 
other  countries  (adjusted  odds  ratio,  0.324; 
95%  confidence  interval,  0. 1 04-1.004). 
Only  19%  of  authors  from  studies  conduct¬ 
ed  in  the  United  States  vs.  52.9%  of  au¬ 
thors  from  other  countries  provided  us  with 
IPD.  We  conclude  that  we  received  a  low  re¬ 
sponse  rate  in  the  acquisition  of  IPD  fora 
meta-analysis  dealing  with  the  effects  of  ex¬ 
ercise  on  bone  mineral  density  in  adults. 

The  use  of  summary  means  vs.  IPD  may  be 
more  appropriate  for  studies  of  this  nature. 
(Am  I  Med  Sports.  2002;4:350-354)  02002 
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The  use  of  meta-analysis  is  becoming  inaeasingly 
common  in  the  exercise  training  and  physical  ac¬ 
tivity  literature.  A  recent  MEDLINE  search  by  one  of 
the  authors  (KSK)  found  that  the  niunber  of  citations 
listed  using  the  keywords  "exercise  and  meta-analy- 
sis"  has  increased  from  two  between  the  years 
1980-1985  to  121  between  the  years  1995-2000  (un¬ 
published  results).  To  the  best  of  our  knowledge,,  all 
meta-analyses  on  this  topic  conducted  to  date  have 
derived  their  results  from  the  aggregation  of  summary 
data  provided  in  the  studies.  An  alternative  approach 
is  the  retrieval  of  individual  patient  data  (IPD)  from 
study  authors. 

One  of  the  major  advantages  of  using  IPD  in  a 
meta-analysis  is  the  potential  for  increased  statistic 
power  as  well  as  a  more  thorough  examination  of  po¬ 
tential  covariates.i-3  Therefore,  the  use  of  IPD  may  be 
especially  appropriate,  since  many  meta-analyses  in¬ 
clude  a  small  number  of  studies,  thus  limiting  the  in¬ 
terpretation  and  application  of  the  findings. 

One  of  the  potential  disadvantages  with  the  re¬ 
trieval  of  IPD  is  the  inability  to  obtain  IPD  from 
studies  that  meet  one's  predefined  inclusion  crite¬ 
ria.  This  results  in  a  form  of  bias  known  as  retrieval 
bias.z  In  addition,  meta-analyses  of  IPD  are  tradi¬ 
tionally  more  expensive  and  labor-intensive  than 
meta-analyses  using  summary  means.  Consequent¬ 
ly,  the  use  of  summary  data  from  individual  stud¬ 
ies  may  be  preferable. 

We  have  previously  published  meta-analytic  work 
dealing  with  the  effects  of  exercise  on  bone  mineral 
density  (BMD)  in  adult  humans.4-8  While  our  previ¬ 
ous  work  has  resulted  in  some  noteworthy  findings, 
these  meta-analyses  were  based  on  the  aggregation  of 
summary  data  from  individual  studies.  Unfortunately, 
we  were  limited  in  our  ability  to  perform  subgroup 
analyses  because  of  a  lack  of  information.  The  ability 
to  examine  potential  factors  associated  with  exerdse- 
induced  changes  in  BMD  is  important  for  deriving  a 
better  understanding  of  the  hue  relationship  between 
exercise  and  BMD.  Since  the  acquisition  of  IPD  could 
lead  to  a  more  accurate  determination  of  the  role  of 
exercise  on  BMD,  we  sought  such  data  from  study 
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authors.  Thus,  the  purpose  of  this  paper  is  to  report 
on  the  level  of  success  of  acquiring  IPD  dealing  with 
the  effects  of  exercise  on  BMD  in  adult  humans. 


ACQUISITION  OF  IPD  DATA.  The  acquisition  of 
IPD  was  conducted  according  to  the  general  guide¬ 
lines  of  Friedenreich.9  For  this  study,  references  for 
IPD  were  derived  from  a  database  that  contained 
76  studies  that  met  our  previously  defined  meta- 
analytic  inclusion  criteria  on  the  effects  of  exercise 
on  BMD  in  adults  (references  available  on  request). 
Prior  to  sending  out  our  request  for  IPD,  a  cover 
letter  and  IPD  request  sheet  were  developed,  re¬ 
viewed,  revised,  and  approved  by  the  three  au¬ 
thors.  We  then  sent,  via  postal  mail,  a  copy  of  the 
cover  letter  and  an  IPD  data  acquisition  form  to 
the  conesponding  authors  of  the  76  studies.  A  fol¬ 
low-up  request,  approximately  5  weeks  later,  was 
sent  to  all  authors  who  did  not  respond  to  our  ini¬ 
tial  request.  If  the  conesponding  author  referred  us 
to  one  of  the  coauthors,  contact  was  made  with 
that  author  in  an  attempt  to  retrieve  IPD.  The  first 
request  contained  no  deadline  date  for  the  receipt 
of  IPD.  However,  the  second  request  included  a 
deadline  date  of  approximately  4  weeks  from  the 
date  of  mailing  for  the  receipt  of  IPD.  This  dead¬ 
line  was  extended  for  those  authors  who  contacted 
us  to  request  additional  time  to  provide  us  with 
IPD.  Some  individual  patient  data  were  already 
available  from  five  of  the  original  studies  in  our 
database  (i.e.,  from  the  published  tables).  However, 
requests  were  also  sent  to  the  corresponding  au¬ 
thors  of  these  studies  in  the  event  that  additional 
IPD  data  might  be  provided.  All  authors  who  sup¬ 
plied  IPD  were  mailed  a  check  for  US  $40  to  help 
cover  incurred  costs.  We  were  limited  to  this 
amount  of  money  because  of  budget  limitations. 
Prior  to  the  start  of  this  study,  approval  was  ob¬ 
tained  by  the  Institutional  Review  Board  at  Massa¬ 
chusetts  General  Hospital. 

STATISTICAL  ANALYSIS.  Descriptive  statistics  (fre¬ 
quencies,  percentages,  ranges,  means,  and  stan¬ 
dard  deviations)  were  used  to  report  overall  results. 
Binary  multiple  logistic  regression  was  used  to  ex¬ 
amine  potential  predictors  for  whether  IPD  were  fi¬ 
nally  sent  to  us  or  not.  Predictors  in  the  model 
included  gender  of  author  contacted,  source  of 
publication  (journal  vs.  other),  country  in  which 
the  study  was  conducted  (United  States  vs.  other), 
and  year  of  publication.  The  likelihood  ratio  statis¬ 
tic  and  Hosmer  and  Lemeshow  test  were  used  to 
identify  whether  the  model  adequately  fit  the 
data.  The  Nagelkerke  R-squared  statistic  was  used 
to  identify  the  amount  of  variance  accounted  for 


by  the  predictor  variables. lo  The  Nagelkerke  R- 
squared  statistic  is  an  adjusted  version  of  the  Cox 
and  Snell  R-squared.  This  adjustment  was  neces¬ 
sary  because  the  Cox  and  Snell  R-squared  statistic 
has  a  value  less  than  1  even  for  a  perfect  model. 
Significance  of  regression  coefficients  for  individ¬ 
ual  predictor  variables  was  examined  using  the 
Wald  statistic.  In  addition,  odds  ratios  and  95% 
confidence  intervals  (Cl),  adjusted  for  other  vari¬ 
ables  in  the  model,  were  used  to  examine  the  sig¬ 
nificance  of  individual  predictor  variables. 
Comparison  of  models  with  and  without  interac¬ 
tions  was  examined  using  the  G  test,  which  com¬ 
pares  the  log  likelihoods  between  two  models. 

For  comparative  purposes,  BMD  results  were  cal¬ 
culated  using  the  standardized  difference  effect  size 
(ES)  estimated  from  the  summary  data  reported  in 
the  studies  and  corrected  for  small  sample  bias.” 
We  were  unable  to  use  the  original  metric  because 
of  missing  data.  The  standardized  difference  ES  was 
calculated  by  taking  the  difference  in  BMD  between 
the  exercise  and  control  groups  and  dividing  by  the 
pooled  standard  deviation  of  the  exercise  and  con¬ 
trol  groups.  For  studies  that  did  not  supply  these 
data,  the  ES  was  calculated  from  other  reported  sta¬ 
tistics  by  previously  developed  methods.Jz  in  gener¬ 
al,  an  ES  of  0.20  is  considered  a  small  effect,  0.50  a 
moderate  effect,  and  0.80  a  large  effect.13  An  ES  of 
0.20  for  example,  means  that  the  exercise  group  dif¬ 
fered  from  the  control  group  by  only  two  tenths  of 
a  standard  deviation  in  favor  of  the  exercise  group. 
We  then  compared  ES  differences  between  those 
studies  that  did  and  did  not  provide  IPD,  using  an 
analysis  of  variance-like  random  effects  model  de¬ 
veloped  for  meta-analytic  research.^  This  was  ac¬ 
complished  by  examining  the  between  (Qj,)  and 
within  (Q„)  group  differences  for  the  ESs  and  their 
variances  from  each  group.  In  addition,  for  those 
studies  that  supplied  IPD,  this  approach  was  used  to 
examine  whether  any  statistically  significant  or 
clinically  important  differences  existed  between  cal¬ 
culations  of  ESs  from  IPD  and  summary  data  report¬ 
ed  in  the  studies. 

The  alpha  level  for  a  type  I  error  was  set  at 
p<0.05.  Ninety-five  percent  CIs  that  did  not  cross 
zero  were  considered  statistically  significant. 
Trends  were  defined  as  values  >0.05  but  <0.10, is 


DESCRIPTION  OF  RESPONSES.  Of  the  76  requests 
mailed  out,  41  (53.9%)  authors  responded,  33 
(43.4%)  did  not  respond  at  all,  and  two  (2.6%)  were 
returned  to  us  because  of  undeliverable/invalid  ad¬ 
dresses.  The  reasons  given  by  those  authors  who  re¬ 
sponded  to  our  request  but  never  supplied  IPD  data 
are  shown  in  Table  1.  Of  the  41  who  did  respond,  26 
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of  the  74  total,  or  35.1%,  provided  us  with  IPD  data. 
Of  the  26  authors  who  provided  IPD  data,  22 
(84.6%)  sent  their  data  as  an  attachment  via  elec¬ 
tronic  mail  (our  suggested  preference),  while  two 
each  sent  data  via  either  postal  mail  (7.7%)  or  fac¬ 
simile  (7.7%).  The  time  taken  from  the  date  initial 
letters  of  request  were  mailed  to  the  date  that  data 
were  received  ranged  from  14-89  days  (X±SD  = 
50±23  days).  Of  the  22  authors  who  provided  IPD 
via  electronic  mail,  15  (68.2%)  included  their  data 
as  a  Microsoft  Excel  attachment  (our  suggested 
preference),  while  the  remaining  seven  (31.8%) 
provided  data  as  an  SPSS  file.  Individual  patient 
data  provided  from  one  author  (for  one  study) 
could  not  be  used  because  of  missing  data  for  BMD 
and  our  inability  to  contact  this  author  at  follow¬ 
up.  Individual  patient  data  from  another  author 
(for  one  study)  was  also  excluded  because  It  was  a 
subset  of  data  from  another  study  already  included 
in  our  database.  Thus,  we  received  usable  IPD  data 
for  24  of  76  studies  (31.6%)  for  which  data  were 
requested.  In  addition,  we  already  had  in  our  pos¬ 
session  IPD  data  from  a  total  of  five  (6.6%)  other 
studies.  This  left  us  with  29  studies  (38.2%)  for  fu¬ 
ture  IPD  level  analysis. 

LOGISTIC  REGRESSION  ANALYSIS.  The  results  of 
our  binary  multiple  logistic  regression  analysis  are 
shovm  in  Table  II.  Approximately  21%  of  the  vari¬ 
ance  was  accounted  for  by  the  predictor  variables 
(R2adj=0.207).  Both  the  likelihood  ratio  statistic 
(x2=13.046;  p=0.017)  and  Hosmer  and  Lemeshow 
test  (x2=4.660;  p=0.793)  demonstrated  that  the 
model  adequately  fit  the  data.  There  was  a  trend 
for  country  where  the  study  was  conducted  (Unit¬ 
ed  States  vs.  other)  to  be  a  predictor  of  whether  or 
not  IPD  were  provided,  suggesting  that  authors  of 
studies  conducted  in  the  United  States  were  less 
likely  to  supply  IPD  when  compared  with  authors 
who  conducted  studies  in  other  countries.  Only 
19.0%  of  authors  from  the  United  States,  vs.  52.9% 


of  authors  from  other  countries,  provided  us  with 
IPD.  No  other  variables  were  significant  predictors 
for  whether  IPD  would  be  provided. 

Since  there  was  a  statistically  significant  associa¬ 
tion  between  country  and  year  of  publication 
(r=0.330;  p=0.004),  we  compared  our  original 
model  with  a  second  model  that  included  the  in¬ 
teraction  between  country  and  year  of  publication. 
No  statistically  significant  difference  was  found  be¬ 
tween  the  two  models  (0=0.464;  p=0.496). 

ES  COMPARISONS.  No  statistically  significant  or  clini¬ 
cally  important  differences  were  found  in  BMD  be¬ 
tween  those  studies  that  provided  IPD  vs.  those  that 
did  not  (IPD  provided:  X±SD=0^34±0.364;  95% 
01=0.069-0.198;  IPD  not  provided:  A3SD=0.195±0.387; 
95%  01=0.143-0.247;  Qb=2.126;  p=0.145).  In  addi¬ 
tion,  for  those  studies  that  supplied  IPD,  no  statistical¬ 
ly  significant  or  clinically  important  differences  were 
found  between  calculations  of  ES  from  IPD  and  sum- 
mary_data  reported  in  the  studies  (calculations  firom 
IPD:  XtSD=0.179±0.413;  95%  01=0.106-0.252;  calcu¬ 
lations  summary  data:  XtSD=0.134±0.364,  95% 
01=0.069-0.198;  0^=0.664;  p=0.415). 


While  the  acquisition  of  IPD  for  meta-analytic  pur¬ 
poses  can  lead  to  increased  statistical  power  and  a 
more  thorough  examination  of  potential  covariates, 
the  results  of  our  investigation  suggest  that  obtain¬ 
ing  such  data  from  authors  of  intervention  studies 
dealing  with  the  effects  of  exercise  training  on  BMD 
in  adults  is  difficult  (31.6%  of  authors  contacted  pro¬ 
vided  IPD).  This,  coupled  with  the  increased  costs  as¬ 
sociated  with  the  retrieval  of  IPD,'  as  well  as  the  fact 
that  we  found  no  differences  in  BMD  between  stud¬ 
ies  that  provided  IPD  vs.  those  that  did  not,  suggests 
that  the  use  of  summary  data  from  the  actual  studies 
may  be  more  appropriate  for  examining  the  effects 


1  Table  I.  Responses  of  Investigators  Who  Responded  to  the  Authors'  Request  but  Never  Supplied  IPD 

No.  OF  Authors 

Results/Responses 

6 

Data  no  longer  available 

2 

Data  no  longer  available  because  of  a  change  in  computer  systems 

3 

Corresponding  author  did  not  have  data;  referred  us  to  another  author  who  did  not 
respond  and/or  did  not  send  data 

1 

Expressed  an  interest  in  providing  data  but  never  provided  such 

1 

Did  not  have  time  to  track  down  data 

1 

Not  willing  to  supply  data  until  published  in  a  refereed  journal  (original  source  was  a 
dissertation) 

1 

Not  willing  to  supply  data  because  meta-analysis  is  inappropriate  for  exercise  and  bone 
mineral  density  studies 
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Variable 

B 

SE 

DF 

SiGNinCANCE 

Exp(B) 

9S%CI 

Constant 

-275.443 

190.997 

1 

0.149 

0.000 

NA 

Country 

-1.128 

0.577 

1 

0.051* 

0.324 

0.104-1.004* 

Gender 

0.246 

0.544 

1 

0.652 

1.279  • 

0.440-3.716 

Source 

0.614 

1.179 

1 

0.602 

1.848 

0.183-18.627 

Year 

0.138 

0.096 

1 

0.146 

1.150 

0.951-1.385 

B=regression  coefhdents  for  the  logistic  regression;  SE=standard  error  of  the  regression  coefficients;  rf^degrees  of 
freedom;  Exp(B)=odds  ratio,  adjusted  for  independent  variables;  95%  CI=95%  confidence  interval  for  the  odds 
ratio;  ‘trend  for  statistical  significance 


of  exercise  training  on  BMD  in  adults.  In  addition, 
the  inability  to  acquire  IPD  results  in  greater  infor¬ 
mation  bias,  thus  limiting  the  intdpretation  of  find¬ 
ings  from  such  studies.  Thus,  the  use  of  summary 
means  vs.  IPD  when  conducting  a  meta-analysis  on 
the  effects  of  exercise  training  on  BMD  in  humans 
should  result  in  a  more  accurate  as  well  as  cost-  and 
time-effective  investigation.  This  is  important  be¬ 
cause  it  would  allow  those  with  limited  resources  to 
conduct  studies  of  this  nature.  Furthermore,  given 
the  proliferation  of  information  in  the  health  care 
field,  a  continued  need  for  meta-analysis  will  exist. 

We  are  not  aware  of  anyone  else  who  has  attempt¬ 
ed  to  retrieve  IPD  for  an  exerdse-related  meta-analy¬ 
sis.  While  the  retrieval  of  IPD  for  meta-analyses  may 
be  problematic  aaoss  all  fields,  including  exercise,  our 
results  suggest  that  it  may  be  especially  problematic 
for  those  individuals  interested  in  conducting  IPD 
meta-analyses  of  exercise  and  bone  studies.  For  exam¬ 
ple,  Amotet  al.16  were  able  to  retrieve  IPD  from  five 
of  seven  trials  (71.4%)  dealing  with  the  effects  of  pre¬ 
operative  radiation  therapy  in  esophageal  carcinoma. 
Another  meta-analysis  reported  the  retrieval  of  IPD 
from  39  of  63  studies  (61.9%)  that  met  their  inclusion 
criteria  on  the  topic  of  breast  cancer  and  hormone  re¬ 
placement  therapy.i7  This  compares  to  approximately 
32%  in  OUT  study. 

One  of  the  surprising  findings  of  this  study  was  the 
trend  for  more  authors  from  studies  conducted  in 
countries  other  than  the  United  States  to  provide  us 
with  IPD.  While  purely  speculative,  it  may  be  that  au¬ 
thors  of  studies  conducted  in  the  United  States  were 
less  likely  to  provide  us  with  IPD  because  they  did  not 
want  to  take  the  time  to  retrieve  such  information. 
This  may  be  especially  tme  given  the  small  amount  of 
money  (US  $40)  we  provided  them  for  the  retrieval  of 
such.  Unfortunately,  we  were  not  able  to  provide 
more  money  because  of  budget  limitations.  Alterna¬ 
tively,  $40  to  researchers  in  some  foreign  countries 
may  represent  a  significant  amount  of  money,  thus 
resulting  in  a  greater  willingness  to  supply  PD. 

Another  possible  reason  for  the  low  response  rate 
from  studies  conducted  in  the  United  States  may  have 
to  do  with  the  investigators'  concerns  about  protect¬ 


ing  their  data  because  of  the  potential  misuse  of  such. 
For  example,  the  strict  guidelines  that  are  enforced  by 
the  vast  majority  of  university  and  hospital  institu¬ 
tional  review  boards  in  the  United  States  surrburiding 
issues  such  as  subject  confidentiality  may  have  pre¬ 
cluded  authors  from  supplying  las  with  IPD^  However, 
we  believe  that  concerns  about  approbation  from  in¬ 
stitutional  review  boards  should  not  be  an  issue,  as  re¬ 
searchers  should  have  data  storage  systems  that 
protect  the  confidentiality  of  patients.  Consequently, 
the  sharing  of  IPD  should  not  be  a  problem.  Re¬ 
searchers  who  informed  us  that  the  data  are  "no 
longer  available"  were  troubling  in  that  the  failure  to 
sustain  PD  in  a  marmer  that  allows  for  verification  of 
an  analysis  might  be  considered  an  ethical  issue.  Al¬ 
ternatively,  this  may  be  an  issue  of  nothing  more 
than  selfishness  on  the  part  of  some  investigators. 
Since  cooperation  and  trust  are  part  of  the  foundation 
of  science,  we  believe  that  any  acts  of  selfishness  on 
the  part  of  investigators  shoiold  be  discouraged. 

Since  our  investigation  was  limited  to  studies  deal¬ 
ing  with  the  effects  of  exercise  on  BMD,  it  may  be  in¬ 
appropriate  to  generalize  our  results  to  other  exercise 
meta-analyses.  This,  coupled  with  the  fact  that  we  are 
not  aware  of  any  other  work  in  the  meta-analytic  field 
that  has  focused  on  predictors  for  retrieval  of  IPD, 
would  lead  us  to  suggest  that  future  research  in  the 
meta-analytic  field  in  general,  and  the  exercise  and 
meta-analysis  field  in  particular,  focus  on  this  area. 
This  may  be  especially  important,  since  only  21%  of 
the  variance  was  accounted  for  in  our  logistic  regres¬ 
sion  model.  Thus,  it  appears  that  there  may  be  other 
unknown  factors,  or  combinations  of  factors,  sur¬ 
rounding  the  retrieval  of  PD. 

In  conclusion,  the  results  of  our  study  suggest  a 
low  response  rate  in  acquiring  IPD  for  a  meta¬ 
analysis  dealing  with  the  effects  of  exercise  train¬ 
ing  on  BMD  in  adults,  and  show  that  success 
appears  to  be  greater  when  IPD  are  requested  for 
studies  conducted  in  countries  other  than  the 
United  States.  Given  the  relatively  low  response 
rate,  and  thus  increased  bias,  the  use  of  summary 
data  may  be  more  appropriate  for  examining  the 
effects  of  exercise  training  on  BMD  in  adults.  ■ 
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Background.  Low  bone  mineral  density  (BMD)  at  the  lumbar  spine  is  a  major  public  health  problem  among  post¬ 
menopausal  women.  We  conducted  a  meta-analysis  of  individual  patient  data  (IPD)  to  examine  the  effects  of  exercise  on 
lumbar  spine  BMD  in  postmenopausal  women. 

Methods.  IPD  were  requested  from  a  previously  developed  database  of  summary  means  from  randomized  and  non- 
randomized  trials  dealing  with  the  effects  of  exercise  on  BMD.  Two-way  analysis  of  variance  tests  with  pairwise  com¬ 
parisons  (p  ^  .05)  and  95%  confidence  intervals  (Cls)  were  used  to  determine  the  statistical  significance  for  changes  in 
lumbar  spine  BMD. 

Results.  Across  13  trials  that  included  699  subjects  (355  exercise,  344  control),  a  statistically  significant  interaction 
was  found  between  test  and  group  (F  =  1 5.232,  p  =  .000).  Pairwise  comparisons  (Bonferroni  f  tests)  revealed  a  statisti¬ 
cally  significant  increase  in  final  minus  initial  BMD  for  the  exercise  group  (5?  ±  SD  =  0.005  ±  0.043  g/cm^,  t  =  2.46,  p  = 
.014,  95%  Cl  =  0.001-0.009)  and  a  statistically  significant  decrease  in  final  minus  initial  BMD  for  the  control  group  (X  ± 
SD  =  —0.007  ±  0.045  g/cm^  t  —  — 3.051,  p  =  .002, 95%  Cl  =  —0.012 — 0.002).  Changes  were  equivalent  to  an  ap¬ 
proximate  2%  benefit  in  lumbar  spine  BMD  (exercise,  -f  1  %,  control,  - 1  %). 

Conclusions.  The  results  of  this  IPD  meta-analysis  suggest  that  exercise  helps  to  improve  and  maintain  lumbar  spine 
BMD  in  postmenopausal  women. 


IT  has  been  estimated  that  approximately  26.2  million 
postmenopausal  women  have  either  osteoporosis  or  os¬ 
teopenia  (1).  As  a  result  of  having  osteoporosis  or  osteope¬ 
nia,  a  person  is  at  an  increased  risk  for  fracture,  particularly 
at  the  vertebrae,  hip,  and  distal  forearm  (2).  Of  these  three 
sites,  fractures  of  the  vertebrae,  which  represent  approxi¬ 
mately  56%  of  all  fractures,  are  the  most  common,  with  an 
estimated  700,000  per  year  (2).  The  health-care  costs  asso¬ 
ciated  with  vertebral  fractures  were  estimated  to  be  approxi¬ 
mately  $746  million  in  1995  and  are  expected  to  increase 
substantially  in  the  future  (3). 

One  of  the  potential  interventions  for  increasing  and/or 
maintaining  vertebral  bone  mineral  density  (BMD)  in  post¬ 
menopausal  women  is  exercise,  a  low-cost,  nonpharmaco- 
logic  intervention  that  is  available  to  most  individuals.  We 
have  recently  conducted  meta-analytic  work  in  which  we 
reported  improvements  in  lumbar  spine  BMD  because  of 
exercise  in  postmenopausal  women  (4,5).  This  work  was 
based  on  the  most  commonly  used  approach  for  conducting 
meta-analytic  work,  that  is,  the  abstraction  of  summary 
means  from  studies  meeting  specified  inclusion  criteria. 
However,  the  use  of  individual  patient  data  (IPD)  versus 
summary  means  from  eligible  studies  represents  the  most 
comprehensive  approach  for  conducting  meta-analytic  work, 
including  the  potential  for  increased  statistical  power  as 
well  as  a  more  thorough  examination  of  potential  covariates 
(6-8).  Given  the  health-care  consequences  of  low  BMD  at 


the  lumbar  spine,  the  possible  benefit  of  exercise  for  im¬ 
proving  and/or  maintaining  lumbar  spine  BMD,  and  the  po¬ 
tential  for  a  meta-analysis  of  IPD  to  provide  more  thorough 
information  regarding  the  effect  of  exercise  on  lumbar  spine 
BMD,  we  sought  to  examine  the  effects  of  exercise  on  lum¬ 
bar  spine  BMD  in  postmenopausal  women  by  conducting  a 
meta-analysis  using  IPD. 

Methods 

Data  Sources 

From  a  previously  developed  meta-analytij:  database  that 
included  the  summary  means  from  76  studies  dealing  with 
the  effects  of  exercise  on  BMD,  we  sought  to  obtain  IPD. 
Briefly,  IPD  were  requested  by  sending  a  cover  letter  and 
data  request  sheet  to  authors  via  postal  mail.  For  those  who 
did  not  respond  to  our  initial  request,  a  follow-up  letter  was 
sent  via  postal  mail  approximately  5  weeks  later. 

Study  Selection 

From  the  database  of  76  studies,  we  included  studies  that 
met  the  following  criteria:  (i)  randomized  and  nonrandom- 
ized  trials  that  included  a  comparative  control  (nonexercise) 
group,  (ii)  exercise  lasting  at  least  16  weeks,  (iii)  postmeno¬ 
pausal  women  only,  (iv)  journal  articles,  dissertations,  and 
masters  theses  published  in  the  English-language  literature, 
(v)  studies  published  between  January  1966  and  December 
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1998,  (vi)  BMD  (relative  value  of  bone  mineral  per  mea¬ 
sured  bone  area)  assessed  at  the  lumbar  spine,  and  (vii)  abil¬ 
ity  to  obtain  IPD  from  authors.  Despite  the  fact  that  methods 
to  assess  BMD  (dual-photon  absorptiometry  [DPA],  dual¬ 
energy  x-ray  absorptiometry  [DEXA])  have  only  been 
widely  available  since  the  1980s,  we  searched  back  to  1966 
to  ensure  that  there  was  no  comparative  technology  that  we 
might  have  missed.  We  did  not  include  studies  from  non- 
English-language  journals  because  of  the  potential  for  error 
in  the  translation  and  interpretation  of  findings.  If  more  than 
one  study  included  the  same  subjects,  for  example,  a  disser¬ 
tation  and  refereed  journal  article,  we  retrieved  and  refer¬ 
enced  both  studies  to  extract  the  maximum  amount  of  infor¬ 
mation  but  only  included  this  as  one  data  set. 

Data  Abstraction 

All  data  were  abstracted  on  a  coding  sheet  that  could  hold 
up  to  91  pieces  of  information  from  each  study.  All  data 
were  coded  and  verified  for  accuracy  and  consistency  by 
George  A.  Kelley.  Blinding  of  the  coder  to  the  identity  and 
institutional  affiliation  of  the  authors  as  well  as  study  results 
was  not  performed  because  it  has  been  shown  that  these 
procedures  have  neither  a  statistically  significant  nor  a  clin¬ 
ically  important  effect  on  the  results  (9).  The  major  catego¬ 
ries  coded  included  study,  subject,  BMD  assessment,  and 
training  program  characteristics  as  well  as  primary  and  sec¬ 
ondary  outcomes. 

Statistibal  Analysis 

Initial  subject  characteristics. — Potential  differences  be¬ 
tween  initial  subject  characteristics  for  exercise  and  control 
groups  were  examined  using  independent  f  tests  and  95% 
confidence  intervals  (CIs)  for  continuous  variables  and  2  X 
2  chi-square  tests  for  categorical  variables. 

Primary  outcomes. — Initial  and  final  values  for  lumbar 
spine  BMD  between  exercise  and  control  groups  were  ex¬ 
amined  by  using  a  two-way  analysis  of  variance  (ANOVA) 
test  with  repeated  measures  on  one  factor  (test).  Because 
this  was  an  unbalanced  design,  a  General  Linear  Model  was 
used.  Pairwise  comparison  tests  (Bonferroni  t  tests)  were 
used  to  identify  the  specific  location  of  the  observed  inter¬ 
action  between  test  (final  vs  initial)  and  group  (exercise  vs 
control).  To  examine  for  outliers,  ANOVAs  were  per¬ 
formed  with  each  study  deleted  from  the  model  once.  Be¬ 
cause  of  missing  data,  we  were  unable  to  include  potential 
covariates  in  the  ANOVA  model.  Consequently,  we  used 
Pearson-Product  moment  correlations  to  examine  for  poten¬ 
tial  associations  between  changes  in  BMD  and  age,  height, 
body  weight,  years  postmenopausal,  cigarette  smoking,  al¬ 
cohol  consumption,  calcium  and  vitamin  D  intake,  compli¬ 
ance  (percentage  of  exercise  sessions  attended),  length  of 
training  (weeks),  type  of  BMD  assessment  (DEXA,  DPA), 
and  study  design  (randomized  vs  nonrandomized  controlled 
trial).  We  were  unable  to  partition  the  data  according  to  the 
different  types  of  exercise  because  of  the  various  interven¬ 
tions  employed. 

Because  of  the  inability  to  retrieve  IPD  from  all  eligible 
studies,  we  also  examined  whether  our  results  differed  be¬ 


tween  studies  according  to  the  availability  of  IPD.  To  in¬ 
clude  all  eligible  studies  in  the  analysis,  we  used  the  stan¬ 
dardized  difference  effect  size  (ES)  calculated  from  the 
summary  data  reported  in  the  studies  and  corrected  for 
small  sample  bias  (10).  In  general,  an  ES  of  0.20  is  consid¬ 
ered  a  small  effect,  0.50  a  moderate  effect,  and  0.80  a  large 
effect  (1 1).  An  ES  of  0.80,  for  example,  means  that  the  ex¬ 
ercise  group  differed  from  the  control  group  by  eight-tenths 
of  a  standard  deviation  in  favor  of  the  exercise  group.  We 
then  compared  ES  differences  between  those  studies  in 
which  IPD  were  provided  versus  those  in  which  they  were 
not  using  an  ANOVA-like  random  effects  model  developed 
for  meta-analytic  research  (10).  This  was  accomplished  by 
examining  the  between  (Qb)  and  within  (Q„)  group  differ¬ 
ences  for  the  ESs  and  their  variances  from  each  group. 

Secondary  outcomes. — Secondary  outcomes  (body  weight, 
calcium  int^e,  and  vitamin  D  intake)  were  analyzed  using 
the  same  ANOVA  procedures  that  were  used  to  evaluate 
changes  in  lumbar  spine  BMD.  We  used  independent  t  tests 
to  analyze  initial  differences  between  exercise  and  control 
groups  for  these  variables  because  more  data  were  available 
for  initial  values  versus  final  values  and  we  wanted  to  cap¬ 
ture  as  much  data  as  possible  in  our  analyses. 

Descriptive  Statistics  and  Alpha  L^el 

Means  and  standard  deviations  (X  ±  SD)  were  used  to 
describe  continuous  variables,  whereas  frequencies  and  per¬ 
centages  were  used  for  categorical  variables.  The  alpha 
level  for  statistical  significance  was  set  at  p  £  .05.  Ninety- 
five  percent  CIs  that  did  not  cross  0.00  were  also  considered 
statistically  significant. 

Results 

Etudy  Characteristics 

Of  the  32  studies  that  met  our  criteria  for  inclusion,  we 
were  able  to  retrieve  IPD  from  13  (41%)  (12-26).  Note  that 
the  number  of  references  exceeds  the  number  of  studies  be¬ 
cause  two  were  published  in  dissertation  (17,24)  and  two  in 
journal  format  (18,25).  A  description  of  the  studies  is  shown 
in  Table  1.  The  13  studies  represented  a  total  of  30  groups 
(17  exercise,  13  control)  and  699  subjects  (355  exercise, 
344  control).  Seven  of  the  trials  were  randomized  controlled 
trials,  and  the  remaining  six  were  nonrandomized  controlled 
tri^als.  The  length  of  the  studies  ranged  from  24  to  104  weeks 
(X  ±  SD  =  56  ±  8  weeks).  Thirteen  of  the  exercise  groups 
included  some  type  of  weight-bearing  exercise,  two  appeared 
to  perform  nonweight-bearing  exercise,  and  the  remaining 
two  participated  in  weight  training.  Compliance,  defined  as 
the  percentage  of  exercise  sessions  attended,  averaged  75  ± 
17%.  Seven  of  the  studies  assessed  lumbar  spine  BMD  us¬ 
ing  DEXA,  whereas  the  remaining  six  used  DPA. 

Initial  Subject  Characteristics 

Initial  subject  characteristics  for  continuous  and  categori¬ 
cal  variables  can  be  found  in  Tables  2  and  3,  respectively. 
For  continuous  variables,  the  number  of  years  that  the  sub¬ 
jects  were  postmenopausal  was  significantly  greater  in  the 
control  versus  exercise  groups,  whereas  calciurn  intake  was 
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Table  1 .  Characteristics  of  Bone  Mineral  Density  Studies  (gm/cirf)  in  Which  IPD  Were  Provided  for  Postmenopausal  Women  at  the 

Lumbar  Spine 


Study 

Bloomfield  and 
colleagues  (12) 


Bravo  and 
colleagues  (13) 


Brooke-Wavell 
and  colleagues 
(14) 


Caplan  and 
colleagues  (IS) 


Ebrahim  and 
colleagues  (16) 

Grove  (17).  Grove  ' 
and  Londeree  (18) 


Iwamolo  and 
colleagues  (19) 


Little  (20) 


Lord  and 
colleagues  (21) 


Martin  and 
Notelovitz  (22) 


Design/Subjects 


Exercise  Intervention(s)  BMD  Assessment 


CT  that  included  18  postmenopausal  women  assigned 
to  either  an  exercise  (n  =  7;  age  =  62.1  ±  2.1  years) 
or  control  (n  =  11;  age  =  60.0  ±  9.4  years)  group. 

RCT  that  included  106  women  assigned  to  either  an 
exercise  (n  =  44;  age  =  59.8  ±  5.9  years)  or  control 
(n  =  62;  age  =  60  ±  6.3  years)  group. 


RCT  of  76  postmenopausal  women  assigned  to  either 
an  exercise  (n  =  37;  age  =  65.0  ±  2.8  years)  or 
control  (n  =  39;  age  =  64.2  ±  3.1  years)  group. 


CT  of  30  postmenopausal  women  assigned  to  either 
an  exercise  (n  =  19;  age  =  66.4  ±  5.0  years)  or 
control  (n  -  1 1;  age  =  65.4  ±  4.9  years)  group. 


RCT  of  92  postmenopausal  women  assigned  to  either 
an  exercise  (n  =  47;  age  =  66.4  ±  7.9  years)  or 
control  (n  =  45;  age  =  68.1  ±  7.8  years)  group. 

RCT  that  included  15  postmenopausal  women 
assigned  to  either  a  low-impact  exercise  group  (n  =  5; 
age  «=  56.6  ±  43.3  years),  high-impact  exercise  group 
(n  =  5;  age  =  54.0  ±  1.9  years),  or  control  group  (n 
=  5;  age  =  56.0  ±  4.5  years). 

CT  that  included  35  postmenopausal  women  assigned 
to  either  an  exercise  (n  =  15;  age  =  64.8  ±  6.1  years) 
or  control  (n  =  20;  age  =  64.8  ±  5.7  years)  group. 

CT  that  included  21  postmenopausal  women  assigned 
to  a  resistance  training  (n  =  6;  age  =  59.5  ±  2.3 
years),  walking  (ii  =  6;  age  =  52.3  ±  4.5  years), 
swimming  (n  =  5;  age  =  5 1 .8  ±  5.8  years),  or  contfol 
(n  =  4;  age  =  60.8  ±  1 .4  years)  group. 


RCT  that  included  138  subjects  assigned  to  either  an 
exercise  (n  =  67;  age  =  70j8  ±5.0  years)  or  control 
(n  =  69;  age  =  71.0  ±  4.9  years)  group. 


RCT  that  included  55  postmenopausal  women 
assigned  to  a  30-minute  exercise  group  (n  =  20;  age  = 
60.3  ±  7.8  years),  45-minute  exercise  group  (n  =  16; 
age  =  57.8  ±  7.1  years),  orconu-ol  (n  =  19;  age  = 
56.7  ±  6.9  years)  group. 


32  weeks  of  training  performed  3X  per  week  for  50 
minutes  per  session  (15-minute  warm-up,  30  minutes 
of  stationary  cycling,  5-minute  cool-down)  at  60- 
80%  of  maximal  heart  rate. 

52  weeks  of  training  performed  3X  per  week  for  60- 
65  minutes  per  session.  Exercise  sessions  consisted  of 
a  10-minute  warm-up,  25  minutes  of  rapid  walking 
replaced  with  aerobic  dance  1 X  per  week,  and  15 
minutes  of  bench  stepping  at  60-70%  of  MHRR.  This 
was  followed  by  10-15  minutes  of  resistance 
exercise. 

52  weeks  of  training  that  consisted  of  self-monitored 
walking  3.5  times  per  week  for  14.8  minutes  per  day 
for  the  first  12  weeks,  followed  by  20.4  minutes  per 
day  of  walking,  4.8  days  per  week,  for  the  remainder  . 
of  the  study. 

104  weeks  of  aerobic  weight-bearing  exercise 
performed  2X  week  for  60  minutes  (warm-up,  20-25 
minutes  of  low-impact  aerobic  exercise,  10  minutes 
of  ball  games  for  improved  hand-eye  coordination 
followed  by  work  on  floor  mats  for  strength  and 
flexibility,  10  minutes  of  relaxation).  Subjects  were 
also  asked  to  exercise  on  their  own  1  x  per  week  so 
that  the  pulse  would  be  elevated  for  at  least  20-30 
minutes. 

104  weeks  of  walking  3  X  per  week  for  40  minutes 
per  session. 

52  weeks  of  training  performed  3X  week  for 
approximately  60  minutes  per  session  (15-20  minute 
warm-up,  20  minutes  of  either  low-  or  high-impact 
exercise,  15-minute  cool-down).  Low-impact 
activities  were  considered  those  that  produced  forces 
less  than  1.5X  body  weight,  high  impact  &2.0X 
body  weight. 

52  weeks  of  outdoor  walking  (7  days  per  week)  and 
gymnastic  training  (at  least  5  days  per  week). 

Resistance  exercise  consisted  of  32  weeks  of  training 
with  9  exercises  performed  3  times  per  week  for  1  set 
of  8-12  repetitions  at  60%-80%  of  IRM;  Walking 
consisted  of  32  weeks  of  training,  3X  per  week  for 
30-50  minutes  per  session  (5-10-minute  warm-up; 
walking  for  20:-30  minutes;  5-10-minute  cool-down) 
at  70%-90%  of  maximal  heart  rate;  Swimming 
consisted  of  32  weeks  of  training,  3X  per  week  for 
30-50  minutes  per  session  (5-10-minute  warm-up; 
walking  for  20-30  minutes;  5-10-minute  cool-down) 
at  70%-90%  of  maximal  heart  rate. 

42  weeks  of  exercise  performed  2X  per  week  for 
approximately  60  minutes  per  session  (5-minute 
warm-up,  35-40  minutes  of  aerobic  exercises 
(activities  for  balance,  hand-eye  and  foot-eye 
coordination],  strengthening  exercises,  15  minutes  of 
stretching,  and  5-10  minute  cool-down). 

52  weeks  of  treadmill  exercise  performed  3X  week 
for  either  30  or  45  minutes  per  session  at  70-85%  of 
maximal  heart  rate.  Each  session  included  a  3-5- 
minute  warm-up  and  cool-down  at  60%  of  maximal 
heart  rate. 


DPA  (Lunar  DP3, 

Lunar  Radiation,  Madison, 
Wl)atLl-L4. 


DEXA  (Lunar  DPX, 
Lunar  Radiation) 
atU-L4. 


DEXA  (Lunar  DPX, 
Lunar  Radiation) 
atL2-U. 


DPA  (Lunar  DPA, 
Lunar  Radiation). 


DEXA  (Lunar  DPX, 
Lunar  Radiation). 

DPA  (Luiw  DP3, 
Lunar  Ramation) 
atL2-L4. 


DEXA  (Norland  XR26, 
Norland  Medical  Systems, 
White  Plains,  NY) 
atU-U. 

DPA  (Lunar, 

Lunar  Radiation) 
atL2-L4. 


DEXA  (Lunar  DPX, 

Lunar  Radiation)  at  L2-L4. 


DPA  (Lunar  DP3, 
Lunar  Radiation) 
atU-U. 


Continued  on  next  page 
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Table  1.  Characteristics  of  Bone  Mineral  Density  Studies  (gm/ctrf)  in  Which  IPD  Were  Provided  for  Postmenopausal  Women  at  the 

Lumbar  Spine  (Continued) 


Study 


Design/Subjects 


Exercise  Intervention(s) 


BMD  Assessment 


Prince  and 
colleagues  (23) 


Pruitt  (24). 
Pruitt  and 
colleagues  (25) 

Ryan  and 
colleagues  (26) 


RCT  that  included  assignment  of  63  postmenopausal 
women  to  a  calcium  and  exercise  (n  =  35;  age  =  62.4  ± 
4.8  years),  or  calcium  only  (n  =  28;  age  =  63.2  ±  4.8 
years)  group. 

CT  that  included  24  postmenopausal  women  assigned 
to  either  an  exercise  (n  =  15;  age  =  53.6  ±  4. 1  years) 
or  control  (n  =  9;  age  =  55.6  ±  2.9  years)  group. 

CT  that  included  28  postmenopausal  women  assigned 
to  either  a  weight  loss  (n  =  15;  age  =  63.4  ±  5.7 
years)  or  exercise  +  weightless  (n  =  13;  age  =  61.3  ± 
4.8  years)  group. 


1()4  weeks  of  weight-bearing  exercise  performed  2X 
week  for  approximately  60  minutes  per  session. 
Subjects  were  also  asked  to  walk  another  2  hours  per 
week  at  60%  of  peak  heart  rate  for  their  age. 

36  weeks  of  strength  training  consisting  of  13 
exercises  performed  3X  week  at  50%-60%  of  IRM 
for  1  set  of  10-12  repetitions  for  the  upper  body  and 
10-15  repetitions  for  the  lower  body. 

24  weeks  of  aerobic  exercise  (treadmill  jogging) 
performed  3X  week  for  up  to  35  minutes  per  session 
at  50  to  >70%  of  VOjmu.  Each  session  included  a 
10-minute  warm-up  and  cool-down  period. 


DEXA  (QDR-1000, 
Hologic,  Waltham,  MA) 
atLl-L4. 

DPA  (Lunar  DP3, 

Lunar  Radiation) 
at  L2-L4. 

DEXA 

atL2-L4. 


Notes:  IPD  =  individual  patient  data;  BMD  =  bone  mineral  density;  CT  =  controlled  trial;  RCT  =  randomized  controlled  trial;  DPA  =  dual-photon  absorptiom¬ 
etry;  DEXA  =  dual-energy  x-ray  absorptiometry;  MHRR  =  maximal  heart  rate  reserve;  RM  =  repetition  maximum.  Study  by  Prince  also  included  placebo  and  milk 
powder  group  but  for  comparison  purposes,  these  groups  were  not  included  in  our  analysis.  Only  subjects  who  completed  the  study  and  for  which  BMD  data  were 
available  are  reported  in  the  designs/subjects  section;  number  of  subjects  reported  as  7  ±  SD.  Bone  density  assessment  limited  to  bone  mineral  density  measures  in  g/ctr?. 


greater  in  the  exercise  versus  control  groups.  No  statistically 
significant  differences  between  exercise  and  control  groups 
were  observed  for  any  other  continuous  or  categorical  vari¬ 
ables. 

Primary  Outcomes 

As  can  be  seen  in  Table  4,  there  was  an  increase  in  lum¬ 
bar  spine  BMD  in  the  exercise  groups  and  a  decrease  in  the 
control  groups.  The  mean  difference  between  the  two 
groups  was  0.013  ±  0.079  g/cm^,  95%  Cl  =  0.007-0.019. 
These  changes  were  equivalent  to  an  approximate  2%  bene¬ 
fit  in  lumbar  spine  BMD  (exercise,  -1-1%,  control,  —  1%). 
The  ANOVA  results  in  Table  5  show  a  statistically  signifi¬ 
cant  main  effect  difference  between  group  and  an  interac¬ 
tion  between  group  and  test.  Pairwise  comparison  tests  for 
the  Group  X  Test  interaction  revealed  a  statistically  signifi¬ 
cant  increase  in  final  versus  initial  BMD  for  the  exercise 
groups  (f  =  2.464,  p  =  .014),'  a  statistically  significant  de¬ 
crease  in  final  versus  initial  BMD  for  control  groups  (t  = 
— 3.051,  p  =  .002),  and  greater  initial  as  well  as  final  values 
for  exercise  groups  compared  to  control  groups  (initial,  t  = 
2.544,  p  =  .011;  final,  t  =  3.320,  p  =  .001).  Results  were 
similar  when  each  study  was  deleted  from  the  model  once. 
For  the  exercise  groups,  larger  increases  in  lumbar  spine 
BMD  were  associated  with  assessment  of  BMD  using 
DEXA  versus  DPA  (r  =  -0.126,  p  =  .018,  95%  Cl  = 


— 0.227--0.022).  For  control  subjects,  larger  decreases  in 
lumbar  spine  BMD  were  associated  with  younger  age  (r  = 
0.170,  p  =  .(K)2, 95%  Cl  =  0.064-0.272),  taller  stature  (r  = 
—0.109,  p  =  .048,  95%  Cl  =  —0.215 — 0.005),  absence  of 
hormone  replacement  therapy  (r  =  0.152,p  =  .(X)5, 95%  Cl  = 
0.047-0.254),  assessment  of  BMD  using  DPA  versus 
DEXA  (r  =  -0.287,p  =  .000, 95%  Cl  =  -0.381—0.187) 
and  nonrandomized  versus  randomized  controlled  trials  (r  = 
0.172,  p  =  .001,  95%  Cl  =  0.067-0.273).  No  other  statisti¬ 
cally  significant  or  clinically  relevant  relationships  were  ob¬ 
served  for  the  exercise  or  control  groups.  Finally,  no  statisti¬ 
cally  significant  differences  in  lumbar  spine  BMD  were 
found  when  we  compared  the  13  studies  that  included  IPD 
(ES  =  0.366  ±  0.423,  95%  Cl  =  0.131-0.600)  with  the  19 
studies  that  did  not  include  IPD  (ES  =  0.219  ±  0.430,  95% 
Cl  =  0.059-0.379;  Qt  =  1 . 1 84,  p  =  .277). 

Secondary  Outcomes 

No  statistically  significant  main  effects  or  interactions 
were  found  for  body  weight,  calcium  intake,  or  vitamin  D 
intake. 

Discussion 

The  primary  purpose  of  meta-analysis  is  to  reach  some 
general  conclusions  about  a  body  of  research.  The  overall 
results  of  this  study  suggest  that  exercise  helps  to  in- 


Table  2.  Initial  Characteristics  of  Subjects  for  Continuous  Variables 


Variable 

n 

Exercise 

r±5D 

n 

Control 

Y+SD 

Significance 

tip) 

Cl 

(95%) 

Age(y) 

340 

63.9  ±  7.4 

335 

64.5  ±  7.4 

-0.92  (.357) 

-1.64  to  0.59 

Height  (cm) 

329 

158.9  ±6.9 

327 

158.2  ±  7.0 

1.18  (.239) 

-0.43  to  1.71 

Body  Weight  (kg) 

340 

65.1  ±  12.3 

330 

64.1  ±  13.0 

1.04(0.299) 

-0.91  to  2.94 

Postmenopause  (y) 

156 

13.0  ±  9.9 

147 

17.3  ±  11.8 

-3.46  (.001)* 

-6.78  to  -1.86* 

Calcium  (mg) 

193 

926.9  ±  394.0 

195 

834.7  ±  350.6 

2.44  (.015)* 

17.79  to  166.64* 

Vitamin  D  (lUs) 

55 

195.9  ±  215.4 

62 

161.5  ±  132.4 

1.05  (.294) 

^30.27  to  99.13 

Note:  Cl  =  confidence  interval;  lUs  =  international  units. 
*Sta6stically  significant. 
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Table  3.  Initial  Characteristics  of  Subjects  for 
Categorical  Variables 


Variable 

Exercise 

ii(%) 

Control 

n(%) 

Xj(p) 

Cigarette  Smoking 

25  (9.9) 

33  (12.7) 

1.04  (.307) 

Alcohol  Consumption 

130(52.2) 

121(47.5) 

1.14  (.285) 

Estrogen/Progesterone  Use 

24  (6.9) 

18(5.4) 

0.65  (.419) 

Previous  Fractures  (any  site) 

61  (37.4) 

62  (41.6) 

0.57  (.450) 

Race  (white) 

259  (94.1) 

238  (91.2) 

1.78  (.182) 

Note'.  Results  limited  to  studies  that  reported  data  for  each  variable. 


crease  and  maintain  lumbar  spine  BMD  in  postmenopausal 
women.  This  supports  our  previous  meta-analytic  work  of 
summary  means  and  lumbar  spine  BMD  (4,5),  but  is  in  con¬ 
trast  to  our  more  recent  meta-analytic  work  using  IPD  in 
which  we  found  no  statistically  significant  difference  in 
femoral  neck  BMD  (27).  Although  these  are  important  find¬ 
ings,  the  clinical  importance  of  an  approximate  2%  benefit, 
especially  as  it  relates  to  fracture  risk,  cannot  be  elucidated 
at  this  time.  However,  although  beyond  the  scope  of  this 
study,  the  increased  strength,  balance,  and  ambulatory  skills 
that  may  be  realized  from  a  regular  program  of  exercise 
may  also  help  reduce  the  risk  of  falling  and  suffering  subse¬ 
quent  fractures  (28).  Although  we  were  unable  to  identify 
specific  exercise  programs  for  optimizing  lumbar  spine  BMD, 
it  would  appear  plausible  to  suggest  that  one  adhere  to  the 
recent  National  Institutes  of  Health  Consensus  Statement 
that  recommends  participation  in  regular  exercise,  espe¬ 
cially  resistance  and  high-impact  activities  (28). 

Our  finding  that  larger  decreases  in  BMD  in  the  control 
groups  were  associated  with  younger  age  is  not  surprising 
given  the  fact  that  bone  loss  is  most  rapid  during  the  early 
postmenopausal  years  (29).  In  addition,  the  observed  asso¬ 
ciation  between  the  absence  of  hormone  replacement  ther¬ 
apy  and  greater  decreases  in  lumbar  spine  BMD  was  also 
not  suiprising  because  hormone  replacement  therapy  is  an 
established  therapeutic  intervention  for  preserving  BMD 
among  postmenopausal  women  (28).  However,  we  can  of¬ 
fer  no  biological  explanation  regarding  the  observed  associ¬ 
ation  between  greater  decreases  in  lumbar  spine  BMD  and 
taller  stature.  This  is  especially  because  it  is  generally  be¬ 
lieved  that  shorter  women  are  considered  more  osteoporotic 
than  taller  women.  Given  this  currently  held  notion,  caution 
is  warranted  in  the  interpretation  of  this  finding.  Indeed,  it 
may  be  that  our  observed  association  was  nothing  more  than 
the  play  of  chance  given  the  large  number  of  statistical  tests 
that  were  conducted  in  our  study. 

Meta-analysis,  like  any  other  type  of  review,  is  limited  by 
the  availability  of  data  and  the  limitations  of  the  included 


Table  4.  Lumbar  Spine  BMD  Results  (g/cra^) 


Initial  Final 

Difference 

Cl 

Group 

n  (X±SD)  0C  +  SD) 

0(  ±  SD) 

(95%) 

Exercise  355  0.991  ±  0.221  0.996  *  0.224 

0.005  ±  0.043 

0.001  ±  0.009» 

Control 

344  0.948  ±  0.218  0.94]  ±  0.218 

-0.007  ±  0.045 

-0.012  *  -0.002* 

Note'.  Cl  =  confidence  interval. 
*Statistically  significant. 


Table  5.  ANOVA  Summary  Table  for  Lumbar  Spine  BMD 
(General  Linear  Model) 


Source  of  Variation 

4f 

SS 

MS 

F 

P 

Partial 

Group 

1 

0.834 

0.834 

8.685 

.003* 

0.012 

Error  (Group) 

697 

66.962 

0.096 

Test 

1 

0.000 

0.000 

0.199 

.656 

0.000 

Test  X  Group 

1 

0.001 

0.001 

15.232 

.000* 

0.021 

Error  (Test) 

697 

0.670 

0.001 

—  ' 

— 

— 

Note'.  SS  =  sum  of  squares;  MS  =  mean  square;  Group  =  exercise  vs  con¬ 


trol;  Test  =  initial  vs  final. 

♦Significantly  different,  p  £  .05. 

studies.  Thus,  in  addition  to  making  the  best  of  the  existing 
data  and  trying  to  reach  some  overdl  conclusions  regarding 
a  body  of  research,  it  is  also  the  meta-analyst’s  responsibil¬ 
ity  to  identify  areas  of  weakness  to  provide  directions  for 
future  research. 

For  example,  because  we  were  unable  to  categorize  the 
different  types  of  exercise  interventions,  we  would  suggest 
that  future  researchers  provide  a  better  description  of  their 
exercise  programs,  especially  as  it  relates  to  the  forces  em¬ 
ployed  during  the  exercise  intervention.  Consequently,  ex¬ 
ercise  programs  that  provide  optimal  benefits  to  lumbar 
spine  BMD  can  be  recommended. 

We  were  surprised  that  data  on  calcium  intake  were 
available  for  only  56%  of  the  subjects  included  in  this  anal¬ 
ysis.  Because  calcium  intake  is  important  for  maintaining 
and/or  increasing  BMD  in  humans,  it  would  seem  reason¬ 
able  to  suggest  that  data  on  calcium  intake  be  assessed  and 
reported.  In  addition,  because  vitamin  D  intake  is  also  im¬ 
portant  for  the  absorption  of  calcium  and  data  on  vitamin  D 
intake  were  available  for  only  17%  of  the  subjects  included 
in  this  analysis,  the  assessment  and  reporting  of  this  infor¬ 
mation  also  appears  warranted. 

Although  white,  non-Hispanic  women  are  disproportion¬ 
ately  affected  with  osteoporosis  and  low  bone  mass,  the 
effect  on  other  races  is  also  significant.  For  example,  the 
National  Osteoporosis  Foundation  has  reported  that  approx¬ 
imately  1 0%  of  black  women  older  than  50  years  have  os¬ 
teoporosis,  and  29%  have  low  bone  mass.  Additionally, 
16%  of  American-Indian  and  Hispanic  women  aged  50  and 
older  have  osteoporosis,  and  36%  have  low  bone  mass  (30). 
Because  approximately  93%  of  the  subjects  in  this  study 
were  white  and  the  responses  to  exercise  in  relation  to  BMD 
may  vary  by  race,  it  is  recommended  that  future  studies  in¬ 
clude  women  from  other  ethnic  groups. 

Because  data  on  the  number  of  years  that  the  subjects 
were  postmenopausal  were  available  for  only  43%  of  the 
subjects  included  in  this  analysis,  future  research  needs  to 
include  this  type  of  information  because  it  may  be  a  poten¬ 
tial  confounder  in  relation  to  exercise-induced  changes  in 
lumbar  spine  BMD  in  postmenopausal  women. 

The  fact  that  the  vast  majority  of  studies  included  in  our 
meta-analysis  were  published  in  refereed  journal  articles 
may  have  led  to  an  overestimate  of  the  benefits  of  exercise 
on  BMD  at  the  lumbar  spine  because  there  is  a  tendency  for 
authors  to  submit,  and  editors  to  publish,  studies  that  yield 
statistically  significant  and  positive  results,  i.e.,  publication 
bias  (10). 
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For  both  exercise  and  control  subjects,  greater  decreases 
in  lumbar  spine  BMD  were  associated  with  assessment  of 
BMD  using  DPA  versus  DEXA.  Because  DEXA  is  gener¬ 
ally  considered  to  be  a  more  valid  assessment  of  BMD  and 
is  currently  the  most  common  method  used  to  assess  BMD 
at  the  lymbar  spine,  the  results  from  studies  using  DEXA 
may  be  more  valid.  The  finding  that  greater  decreases  in 
lumbar  spine  BMD  were  associated  with  nonrandomized 
versus  randomized  trials  suggests  that  randomized  trials 
may  yield  more  valid  results. 

Although  the  above-described  associations  are  interest¬ 
ing,  they  should  be  viewed  with  caution  for  the  following 
reasons:  (i)  they  may  have  been  nothing  more  than  the  play 
of  chance  given  the  large  number  of  statistical  tests  that 
were  conducted,  (ii)  we  were  unable  to  examine  for  poten¬ 
tial  interrelationships  between  variables  because  of  missing 
data,  and  (iii)  the  associations  accounted  for  only  a  small 
proportion  of  the  total  variance. 

In  conclusion,  the  results  of  this  IPD  meta-analysis  sug¬ 
gest  that  exercise  improves  and  maintains  lumbar  spine 
BMD  in  postmenopausal  women. 
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Exercise  and  Bone  Mineral  Density  at  the 
Femoral  Neck  in  Postmenopausal  Women: 

A  Meta-Analysis  of  Controlled  Clinical  Trials 
Using  Individual  Patient  Data 

George  A.  Kelley,  MD;'  KrIsH  S.  Kelley,  MEd;'  Zung  Vu  Tran,  PhD2 


T}?e  purpose  of  this  study  was  to  condua  a  meta¬ 
analysis  using  mdiidduat  patient  data  in  order  to  ex¬ 
amine  the  efficacy  of  exercise  for  improving  hone 
mineral  density  (HMD)  at  the  fe7noral  neck  in  post- 
7}icnopausal  women.  Ten  controlled  clinical  trials  that 
included  SOS  subjects  aged  42-92  years  met  the  crite¬ 
ria  for  inclusion.  Across  all  designs  and  categories, 
there  was  an  nicrcase  in  HMD  of  0,73%±S.S2%  attd 
0.4S%±6,78%,  resjjectively,  in  the  exercise  and  con¬ 
trol  subjects.  However,  compaiison  of  initial  and  final 
UMD  values  between  exercise  and  cotitrol  subjects  re¬ 
vealed  no  statistically  significant  effect  of  exercise  on 
femoral  neck  HMD.  hi  addition,  random-effects 
analyses  revealed  no  statistically  significant  unthin-  or 
between-group  differences  for  studies  in  which  indi¬ 
vidual  patient  data  were  available  vs.  those  in  whiclr 
such  data  were  unavailable.  The  results  of  this  study 
suggest  that  exercise  has  no  effect  on  femoral  neck 
BMD  in  postmenopausal  women. 

(Disease  Management  and  Clinical  Outcomes. 
2002HiXX-XX).  ^2002  l.ejacq  Cvmmioiicatioiu,  Inc. 

Prom  the  Clinical  Investigation  Program, 

MGU  hisiituie  of  Health  Professions,  Boston,  MAP 
and  the  Department  of  Preventive  Medicine  attd 
Biometrics,  University  of  Colorado  Health  Sciences 
Center,  Denver,  CO^ 

Address  for  correspondencc/reprint  requests: 

George  A.  Kelley,  PACSM,  Associate  Professor.  Gradu¬ 
ate  Program  in  Cimical  Investigation,  Director,  Meta- 
Analytic  HcscarcJ)  Group.  MCH  Institute  of  Health 
Professions,  101  Merriiuac  Sttaek-Bioem-i'0S9B. 

Boston,  MA  02444 
■4^oTTr.‘ 

;haxefJ-?'72tf'80i2-"<-v 
K-niail:  gakelley@bics.bwh.harvard.edu 
Manuscript  received  October  1 8,  2001; 
accepted  November  9,  2001 

WIN  I  KR  2002 


Osteoporosis  is  a  major  public  health  problem 
in  the  United  States.  In  1996,  it  was  estimat¬ 
ed  that  approximately  29  million  women  and  men 
over  the  age  of  50  years  had  osteoporosis  or  had 
low  bone  ntass  and  were  at  risk  of  developing  the 
disease.'  Of  these  estimated  29  million  women 
and  men,  the  vast  majority  (approximately  23  mil¬ 
lion)  were  women.  By  the  year  2035,  the  preva¬ 
lence  and  risk  for  this  disease  among  women  50 
years  and  older  is  estimated  to  increase  to  approx¬ 
imately  35  million.'  The  most  devastating  conse¬ 
quence  of  osteoporosis  and  osteopenia  is  an 
increased  risk  for  fraaure.  For  example,  beginning 
at  age  50,  white  women,  a  group  at  the  greatest 
risk  for  osteoporosis  and  osteopenia,  have  a  40% 
chance  of  fracturing  the  spine,  hip,  or  distal  radius 
during  ihcir  remaining  lifetime.^  In  absolute  terms, 
the  number  of  fractures  associated  with  osteo¬ 
porosis  has  been  estimated  to  be  approximittely 
700,000  at  the  vertebrae,  300,000  at  the  hip,  and 
250,000  at  the  distal  forearm.  2  In  the  United 
States,  the  health  care  costs  associated  with  osteo¬ 
porotic  fractures  exceed  $13.8  billion  annually .2 
While  the  greatest  number  of  fractures  occur 
at  the  vertebrae,  the  most  devastating  fracture 
in  terms  of  economic  costs  and  mortality  arc 
fractures  of  the  hip — specifically,  the  proximal 
femur.^  For  example,  each  hip  fracture  in  the 
United  States  has  been  estimated  to  cost  ap¬ 
proximately  $32,000  in  total  medical  expendi¬ 
tures. 3  In  addition,  the  1-year  mortality  rate 
following  a  hip  fracture  is  approximately  20%.'* 
Furihcrmoic,  while  difficult  to  assess,  limited 
ambulatory  skills,  as  well  as  fears  about  addi¬ 
tional  fractures,  may  affect  the  quality  of  life  of 
individuals  who  have  experienced  a  fracture.' 
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^  Finally,  n  woman’s  risk  of  a  hip  fracture  is  equiva¬ 
lent  to  her  combined  risk  of  developing  breast, 
uterine,  and  ovarian  cancer.^ 

A  recent  consensus  statement  from  the  Nation¬ 
al  Institutes  of  Health  Consensus  Development 
Panel  on  Osteoporosis,  Prevention,  Diagnosis,  and 
Therapy,^  concluded  that  during  the  later  years  of 
life,  exercise,  in  the  presence  of  adequate  calcium 
and  vitamin  D  intake,  has  a  modest  effect  on 
slowing  the  loss  of  bone  mineral  density  |BMD).f 
Our  previous  mcta-analytic  research,  using  sum¬ 
mary  means  from  eligible  studies,  revealed  im¬ 
provement  in  BMD  at  the  femur  as  a  result  of 
site-specific  aerobic  exercise,  as  well  as  progressive 
resistance  exercise,  in  postntenopaiisal  womenA* 
Unfortunately,  because  of  the  small  number  of 
summary  means  from  the  studies,  different  assess¬ 
ment  sires  at  the  femur  (femoral  neck,  Ward’s  tri¬ 
angle,  trochanter,  intertrochanter)  had  to  be 
combined  into  one  summary  measure.  Conse¬ 
quently,  we  did  not  examine  the  effects  of  exercise 
on  BMD  at  rhe  femoral  neck,  the  most  common 
site  assessed  at  the  femur.  An  alternative  approach 
for  dealing  with  this  issue  is  to  conduct  a  meta- 
nnnlysis  using  individual  patient  data  (1PD)  rather 
than  summary  means  from  the  studies.  To  the  best 
of  our  knowledge,  we  arc  not  aware  of  anyone 
who  has  attempted  to  use  IPD  for  the  purpose 
of  conducting  a  meta-analysis  on  the  effects  of 
exercise  on  BMD  at  the  femoral  neck  in  post¬ 
menopausal  women. 

Given  the  health  care  consequences  of  low  BMD 
at  rhe  femoral  neck  in  postmenopausal  women,  as 
well  as  the  potential  benefit  of  exercise — a  non- 
phnrmncolngic  intervention  that  is  available  to 
most  people — a  need  exists  to  examine  the  effects 
of  exercise  in  this  setting.  Therefore,  we  conducted 
a  meta-analysis  using  IPD  in  order  to  examine  the 
efficacy  of  exercise  in  improving  BMD  at  the 
femoral  neck  in  this  population  of  women. 

MFriHODS 
Data  Sources 

Studies  for  this  investigation  were  extracted  from 
a  larger  exercise  and  bone-deiisity  database  in 
which  IPD  was  available.  Briefly,  IPD  were  ob¬ 
tained  by  sending  a  cover  letter  and  data  request 
form  to  authors  of  reported  studies  via  postal 
mail.  Authors  who  did  not  respond  received  a 
follow-up  request  approximately  5  weeks  after 
the  initial  mailing. 

Study  Selection 

Inclusion  criteria  for  this  study  were  as  follows: 

1)  randomized  and  nonrandomized  trials  that  in¬ 


cluded  a  comparative  non-cxercisc  control 
group;  2)  site-specific  loading  exercise  lasting 
a  minimum  of  16  weeks;  3)  postmetiopausal 
women  only;  4}  journal  articles,  dissertations, 
and  masters  theses  published  in  the  English  lan¬ 
guage  literature;  i)  studies  published  between 
January,  1966  and  December,  1998;  6)  BMD 
(relative  value  of  bone  mineral  per  measured 
bone  area)  assessed  at  the  femoral  neck;  and  7) 
ability  to  obtain  IPD  from  authors.  Exclusioh 
criteria  for  this  study  included  the  following:  1) 
studies  in  men  or  premenopausal  women;  2)  ob¬ 
servational  studies;  3)  review  articles;  4)  case  re¬ 
ports;  5)  comments;  6)  letters;  7)  nonhuman 
studies;  8)  foreign-language  articles;  and  9)  ab¬ 
stracts  from  conference  meetings.  We  did  not  in¬ 
clude  foreign  language  articles  because  of  the 
potential  for  error  in  the  translation  and  inter¬ 
pretation  of  findings.  We  limited  our  analysis  to 
BMD  at  the  femoral  neck  because  this  is  the 
most  common  site  assessed  and  because  of  miss¬ 
ing  data  for  the  ocher  sites  (Ward’s  triangle, 
trochanter,  intertrochantcr).  For  studies  that 
were  meltiply  published, 4^  as  both  a  disserta-  ^ 
tion  and  a  refereed  journal  article,  we  examined 
and  referenced  both  in  order  to  derive  the  maxi-, 
mum  amount  of  information  possible;  but  in¬ 
cluded  this  information  as  only  one  set  of  data. 

Data  Abstraction 

A  coding  sheet  that  could  hold  94  pieces  of  infor¬ 
mation  was  developed  and  used  in  this  investiga¬ 
tion.  In  addition,  coding  instructions  that  described 
how  to  code  each  item  on  the  coding  sheet  were 
developed  and  used.  The  three  major  categories  of 
variables  that  were  coded  for  included  1)  study 
characteristics;  2)  subject  characteristics;  and  3) 
outcome  data.  Ail  IPD  were  abstracted  and 
checked  for  accuracy  by  the  first  author.  Blinding 
of  the  coder  to  study  information  (identity  and 
institutional  affiliation  of  authors,  study  results) 
was  not  performed  because  it  has  recently  been 
shown  that  these  procedures  have  neither  a  sta¬ 
tistically  significant  nor  a  clinically  important  ef¬ 
fect  on  the  results.’ 

STATISTICAL  ANALYSIS 
For  IPD,  meansssiandard  deviations  (SD)  were  used 
to  describe  continuous  variables,  while  frequencies 
and  percentages  were  used  for  categorical  variables. 

For  continuous  variables,  independent  t  tests  were 
used  to  examine  differences  in  initial  characteristics 
between  exercise  and  control  groups,  while  chi-square 
tests  were  used  for  categorical  variables.  In  order  to 
examine  initial  and  final  values  for  BMD  between  cx- 
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eicisc  mid  control  groups,  a  two-way  analysis  of  vari¬ 
ance  (ANOVA)  wrh  repeated  meiisures  on  one  factor 
was  conducted,  llic  within-subjccts  or  trial  factor  was 
a.sscssmcnt  of  BMO  (initial  and  final),  while  the  be- 
twccii-subjects  or  grouping  factor  was  group  assign¬ 
ment  (exercise  or  control).  Since  this  was  an 
unbalanced  design,  a  general  linear  model  was  em¬ 
ployed.  Prior  to  conducting  the  ANOVA,  Pearson 
prodiicr  moment  correlations  were  used  to  examine 
for  potential  covariates  to  enter  into  the  ANOVA 
ihoticl.  Tliis  was  accompJislicd  by  examining  the  asso¬ 
ciation  between  changes  in  femoral  neck  BMD  and 
the  following  variables:  age,  height,  initial  body 
weight,  changes  in  body  weight,  initial  body  mass 
index  (kg/ni2),  changes  in  body  nia.ss  index  (kg/m^), 
percent  body  fat,  clianges  in  percent  body  fat,  initial 
lean  body  mass,  changes  in  lean  body  mass,  initial 
‘  maximum  oxygen  consumption  (mUkg-Vmin-*), 
changes  in  maximum  oxygen  consumption AulA'^ 
'/"■'In'’)*  mcnarchc,  years  postmenopausc^ig- 
'^rettc  smoking,  .alcohol  consumption,  calcium  and 
'vitamin  D  intake,  compliance  (percentage  of  exercise 
sessions  anended),  length  of  training  in  weeks,  type  of 
exercise  intervention  (weight-bearing,  non-weight- 
bcaring,  weight  training),  type  of  BMD  assessment 
(dual-energy  x-ray  absorptiometry,  dual-photon  ab¬ 
sorptiometry  ),  and  study  design  (mndombed  vs.  non- 
randomixed^  controlled  trial).  However,  since  only 
initial  BMD  Was  significantly  as.sociatcd  with  changes 
in  UMD  (r=0.H7;  p=  0.01 1)  and  this  variable  was  al¬ 
ready  in  the  ANOVA  model,  no  covariates  were  en¬ 
tered  into  the  analysis.  The  alpha  level  for  statistical 
signific.'incc  for  both  the  Pearson  product  moment 
correlations  and  ANOVA  was  set  at  ^"^0^05. 

Because  we  were  not  able  to  retrieve  IPD  from 
all  eligible  studies,  we  also  examined  whether  our 
results  differed  between  Studies  in  which  IPD  were 
available  and  those  in  which  they  were  not.  In 
order  to  accomplish  this,  we  used  the  standardized 
difference  effect  size,  calculated  from  the  summary 
data  reported  in  the  studies.^  This  was  calculated 
by  subtr.'icting  the  change  outcome  in  the  exercise 
group  from  the  change  outcome  in  the  control 
group,  and  then  dividing  by  the  pooled  standard 
deviation  of  the  exercise  and  control  groups.  *  The 
effect  size  was  then  corrected  for  small-sample 
bias.*  If  means  and  standard  deviations  were  miss¬ 
ing,  the  effect  size  was  calculated  from  reported  test 
statistics.^  In  general,  an  effect  size  of  0.20  is  con¬ 
sidered  a  small  effect,  0.50  a  moderate  effect,  and 
O.S’O  a  large  effect.^o  An  effect  siz.e  of  0.50,  /or  ex¬ 
ample,  means  that  the  exercise  group  differed  from 
the  control  group  by  five  tenths  of  a  standard  devi¬ 
ation  in  favor  of  the  exercise  group.  For  studies 
that  included  more  than  one  group,  an  effect  size 


was  calculated  for  each  group  independently  of  the 
other.  'We  used  the  standardized  difference  effect 
size  vs.  the  original  metric  (BMD  in  g/cm^)  so  that 
all  eligible  studies  could  be  included  in  the  analysis. 
We  then  compared  effect  size  differences  between 
those  studies  in  which  IPD  were  provided  and  those 
in  which  IPD  were  not  provided,  using  an 
ANOVA-like  random  effects  model  developed  for 
mcta-analytic  research,*  This  was  accomplished  by 
examining  the  between-  (Qt)  and  wlthin-group 
(Q^)  differences  for  the  effect  sizes  and  their  vari¬ 
ances  from  each  group.  The  alpha  level  for  statisti¬ 


cal  significance  was  set  at  ps.0,05. 
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RESULTS 

Study  Characteristics  ^ 

Twenty-three  studies  from  25  publications  met  our 
criteria  for  inclusion.  The  number  of  publica¬ 
tions  exceeded  the  number  of  studies  because  we  in¬ 
cluded  two  studies  that  appeared  in  the  form  of  both 
a  dissertation3***®  and  a  refereed  journal  articlc.30.34  , 
From  these,  IPD  was  available  from  10  studies  and  v., 
11  publications.3t~i*»3i.2z,i7,29,3o,32  Thus,  we  were 
able  to  include  IPD  from  approximately  43%  of  eli¬ 
gible  studies.  A  general  description  of  the  studies  is 
shown  in  Table  1.  Five  of  the  studies  were  random¬ 
ized  controlled  trials  and  five  were  controlled  trials. 

The  10  studies  included  a  total  of  22  groups  (12  ex¬ 
ercise,  10  control)  and  595  subjects  (295  exercise, 

300  control)  that  met  our  criteria  for  Inclusion.  The 
subjects’  ages  ranged  from  42-92  years  in  the  exer¬ 
cise  groups  and  46-86  years  in  the  control  groups. 

The  length  of  training  for  the  studies  ranged  from 
32-104  weeks  (meaj^iSD,  58i33  weeks).  Ten  ^ 
groups  performed  weight-bearing  exercise,  two  per¬ 
formed  non-weight-bcaring  exercise,  and  two  others 
performed  weight  training  exercise.  Six  of  the  studies 
used  dual-energy  x-ray  absorptiometry  to  assess 
BMD  at  the  femoral  neck  while  the  other  four  used 
dual-photon  absorptiometry. 


Subject  Characteristics 

Initial  characteristics  of  the  subjects  for  continuous 
variables  are  shown  in  Table  11.  As  can  be  seen, 
there  were  no  statistically  significant  differences  be¬ 
tween  exercise  and  control  subjects  in  relation  to 
age,  height,  body  weight,  and  vitamin  D  intake. 
However,  subjects  in  the  exercise  groups  had  signif¬ 
icantly  higher  levels  of  calcium  intake,  while  the 
number  of  years  post'-wtcnopausc  was  greater  in  the 
controls.  Initial  characteristics  of  the  subjects  for 
categorical  variables  are  shown  in  Table  III.  No  st.v 
tistically  significant  differences  were  found  between 
exercise  and  control  groups  in  relation  to  cigarenc 
smoking,  alcohol  consumption,  use  of  estrogen 
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Tabic  1.  Characteristics  of  Femoral  Neck  Bone  Mineral  Density  Studies  (gm/cm*)  in  Which  IPD  Were  Provided  or 
Available 


Study 


Dfsion/Sodjecb 


Exercise  lNn*vi>fnoN(s) 


BMD  Assessment 


DloftniricliJ  & 


Br.ivo© 
ci  alX 


Broiilc-Wavcll^ 
ctal.  r.  -  ■ 


CirlanV^ 

ec  Warii*' 


Ehrahim'* 


UmIcJ. 


Cl  Jll.*" 


Priiice^? 
Cl  nl. 


M  Ol.  * 


CT  that  included  i7  posimrnopausnl 
svanien  aisiiincd  lu  either  an  evcivisc 
(n>7;  ager£].9a2.3  years)  or  a 
control  (n>10i  age>i9.d*9.7 
ycart)  group 

RCr  that  indtuled  106  luieopchic 
women  assigned  lo  cither  an 
exercise  (n«44;  ageaS9.8itS.9 
years)  or  a  control  (ns£2; 
ages60.0xS.3  years)  group 


ncr  that  included  77  posimciiopansal 
women  assigned  to  cither  an  cxcix-Uc 
(ns8;  agce64.9>3.0  years)  or  a 
control  |ns39i  age^se. 1x3.1 
years)  group 

CT  that  included  30  postnicnopaiital 
women  assigned  lo  cither  an  exercise 
|n«19i  ag<xd6.4a5.1  years)  or  a 
control  (nxll;  agex63,4x5.0 
j-rars)  group 


RCT  that  included  91  postmenopausal 
women  assigned  to  either  an  exercise 
(nxS6:  agr«66Jx8.l  years)  or  a  control 
(n«43i  nge«»;8.2x7.9  years)  group 

Crthai  tncliulod  21  posiinenoponsal 
women  assigned  to  a  resistance  trsiining 
(n«6;  agee59.3a2.4  years),  walking 
(nxC,;  agv'x52.2a4.3  years],  swimming 
(n«3;  ago51.4a5,8  years),  or  control 
<n«4;  agc«60.5aU  yenra)  group. 


0 


RCT  that  inriiidi'd  136  subjects  assigned 
to  riilk'i'  an  exercise  (na66i  ngcr70.9s5.0  years) 
or  a  conitol  (nx70;  agcx?]  .0>3.0  yenrs]  group 


RCT  that  inrliided  61  pnsimriiopausal 
wnnirn  assigned  in  cillici  a  ealduin 
and  exercise  (n'26i  agcs63.6x4.3  years) 
tit  a  calcium  (nx33;  agCK62.4-'4.8  years) 
group 

CT  iliat  included  26  puximrnopausnl 
woinrn  assignrti  to  ciihei  an  exercise 
(nx17;  sgCx5J.6x4.1  years)  or  a  control 
(n'>9i  3gcs55.6>2.9  years)  group 

Cr  that  included  30  pnsiincnopaiisa] 
women  assigned  to  cither  a  weight  loss 
(nxl  j;  agC£62.5x5.i  years] 
or  nn  exercise  ■»  weight  loss 
tneli;  agrx63.4xi.7  years)  gronp 


32  weeks  of  training  perfonned  3x  per  week  DPA  (Lunar  t}P3)< 

for  50  minutes  per  session  (15-minuic  waiyn-up, 

30  mintucs  siatioiuiry  cycling,  5*minuiii'OOl' 
down)  at  60%*80%  of  MHR 

52  weeks  of  training  performed  3x  per  week  -  PEX  A  (Umar  DPX) 

foe  60-65  mintucs  pu  session.  Exercise  sessions 

cnnsiMcd  of  a  lO^minute  warm-up,  25  minutes 

of  rapid  walking  replaced  with  aerobic  dance 

lx  pec  wedt,  arid  15  minutes  of  bench  stepping  at 

6076-70%  of  MHR.  This  was  followed  by 

10-15  minutes  of  trsittanct  exercise. 

52  weeks  of  training  that  eonslstcd  of  sdf-monilored  DEXA  (Umar  DRX) 

walking  3.5x  timet  per  week  lot  14.8  minutes  per 

day  for  the  first  12  weeks,  followed  by  20.4 

mintucs  per  day  of  walking,  4.8  days  per  week, 

for  the  remainder  of  the  sti^y. 

'  104  weeks  of  aerobic  weight-bearing  DPA  (Lunni  DPA) 

exercise  performed  2x  per  week  for 
60  minutes  (warm-up,  20-25  minutes 
of  low-impact  aerobic  exercise,  1 0  minutes 
of  ball  games  for  improved  hand-eye 
coordination  followed  by  work  on  (lour  mats 
for  strength  and  flexihiliiy,  10  minutes  of 
relaxatian).  SiiHeett  were  also  asked  to  ex-etcise 
on  their  own  lx  per  week  so  that  the  pulse 
would  be  elevated  for  at  least  20-30  minutes, 

104  weeks  of  walking  3x  pet  week  (or  DEXA  O-unst  DPX) 

40  minutes  per  session. 


Resistance  iraming  consisted  of  32  weeks  DPA  (Lunar) 

of  training  consisting  of  9  exercises 
performed  3x  timet  per  week  (or  1  set  of 
8-12  repetitio/H  at  60%-B0%  of  IRM; 

)i(1dking  consisted  of  32  week*  of  training, 

3x  persvrek  for  30-50  minutes  per  session 
(5-10  minute  worm-npi  walking  for  20-30 
minutes;  5-10  minute  cool-<io>vn)  si  70%-90% 
of  MHR{  iwhmung  Cisiuisted  of  32  weeks 
of  training,  .Qt  per  week  for  30-50 
mtmiics  per  session  (5-1 0  minute  warm-up, 
walking  for  20-30  minutes,  5-10  minute 
cool-down)  at  70%-90%  of  MHR 
13  beau  per  minute. 

42  weeks  of  exercise  perlormed  2x  per  week  DEXA  (lanvtr  UPX) 

fur  approximately  60  miniiics  per  session 

(5  minute  warm-op,  35-40  minuiet  of  aerobic 

exercises  consisting  of  oeiiviiies  for  balance, 

hand-eye  and  foot-ryc  coordination,  nnd 

siicnginening  rxctcisea,  15  mimitcs  of 

sirnching,  end  5-10  ntinuic  cool-down). 

1 04  weeks  of  weight-beating  exercise  DEXA  (QDR-1000) 

performed  2x  week  for  approximately  60 

mintucs  per  session.  Siihircis  were  also  asked 

to  walk  another  2  hours  per  week  at  60%  of 

peak  licari  talc  for  their  age. 

36  weeks  of  strcngtii  training  DPA  iLunni  DP3) 

convisiing  of  13  exercises  performed 

3x  week  at  50%— 60%  of  iRM  for 

1  set  of  10-12  irpctiiions  for  the  upper  body  and 

10-15  repciitions  (or  the  lower  body. 

24  weeks  of  aerobic  exercise  (treadmill  DEXA  (Lunar) 

jogging)  performed  3x  week  fur  up  lo  35 
minutes  per  session  at  50%  to  >70%  of  Vr)j,u,. 

Ends  session  included  1 0-miniiie  warm-up  and 
cool-down  petiocls. 


Numbers  uf  subjects  are  limited  to  those  foi  whom  valid  IPD  (individual  paiirnt  data)were  available.  Croups  were  limited  to  those  that  met  our  inciu- 
siiirt  f  ritcriaj) 

BMiJsbfine  niineral  density:  CTrconirolled  trial;  RCT>randomizcd  contfollcd  trial;  PEXAcdual- energy  x-tay  absorpiiomi’iiy; DPAcdual-phoion  al>- 
I,  sorptiomciiy;  MHRvmavimal  heart  tare;  V02„^„xmjximal  oygen  coitstinipiionj^AU!  1*LEASE  SPELL  OUT3W^,  and  piovidc  into  for  Lunar  and__  J 
I  yQDR  modi-is!  company,  city,  state.)  ^  U——— — . , . .  ..  »  — 
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and/or  progesterone,  previous  fractures  at  any  site, 
and  race.  *l'he  majority  of  subjects  were  nonsmok¬ 
ers,  not  taking  any  type  of  estrogen  and/or  proges¬ 
terone,  and  were  white.  A  little  more  than  one  half 
of  the  snbjeas  reported  consuming  alcohol  and/or 
experiencing  a  previous  fracture  at  one  or  more 
.sites  in  the  body. 

Outcomes 

Femoial  neck  IJMD  values  are  shown  in  Table  IV. 
Across  all  designs  and  categories,  there  was  an  in¬ 
crease  of  0.73%±5.52%  in  the  exercise  subjects  and 
0.45%i6.78%  in  the  control  subjects.  Comparison 
of  initial  and  final  BMD  values  for  potential  differ¬ 
ences  is  shown  in  Table  V.  Across  all  studies  and 
subjects,  statisticalJy  significant  main  effects  differ¬ 
ences  were  observed  for  groups  but  not  trials.  How¬ 
ever,  there  was  »\o  stari.stically  significant  interaction 
between  groups  and  trials,  indicating  that  exercise 
did  not  have  any  effect  on  femoral  neck  BMD.  In 
addirion,  no  sraiisiicatly  significant  interaction  was 
observed  when  results  were  analyzed  with  each 
study  and  its  subjects  deleted  from  the  model.  Fur¬ 
thermore,  no  statistically  significant  within-  or  bc- 
rween-group  differences  in  femoral  neck  BMD  were 
observed  when  we  compared  effect  size  results  for 
studies  in  which  IPD  were  available  with  those  that 
were  not  (meao)^i-.SD,  IPD  available,  0.098±0.352; 


IPD  not  available,  0.164±0.416;  Qw,  30.725, 
p=0.429;  Qb,  0.242,  p=0.623). 

DISCUSSION 

The  results  of  this  study  suggest  that  exercise  does 
not  improve  femoral  neck  BMD  in  postmenopausal 
women.  In  addition,  we  found  no  differences  be¬ 
tween  the  results  of  those  studies  in  which  IPD 
were  available  and  those  in  which  IPD  were  not 
available.  Furthermore,  our  results  are  consistent 
with  the  majority  of  findings  of  the  original  studies, 
in  that  79%  of  the  outcomes  at  the  femoral  neck 
were  reported  as  not  being  statistically  significant. 
In  contr.'ist,  our  findings  appear  to  be  somewhat 
different  than  the  recent  position  statement  from 
the  National  Institutes  of  Health  Consensus  Devel¬ 
opment  Panel  on  Osteoporosis,  Prevention,  Diag¬ 
nosis,  and  Therapy,  which  suggested  that  exercise 
during  the  later  years  probably  has  a  modest  effect 
on  slowing  the  decline  in  BMD.'*  However,  this  was 
a  broad  statement  and  not  specific  to  any  one  site 
in  the  body.  Our  findings  also  conflict  with  our  pre¬ 
vious  meta-analytic  work,  in  which  an  approximate 
2%  improvement  in  BMD  was  found  at  the  hip  as 
a  result  of  sice-specific  aerobic  exercise  and  progres¬ 
sive  resistance  training.^**  One  of  the  possible  rea¬ 
sons  for  the  discrepant  results  between  our  current 
and  previous  meta-analytic  work  may  have  to  do 


VARlAm.t' 

N 

Exercise 

MriANiSD 

N 

Control 

MeanxSD 

f  SiCNinCANCF. 

'^(r)  Vaiue - 

Age  (years) 

278 

64.90±7.11 

291 

65.13x7,12 

-0.39  (0.70) 

McigliC  (cm) 

277 

159.01x6.69 

288 

158.42x7.08 

1.02  (0.31) 

Weight  (kg) 

278 

65.54x11.28 

286 

64.77x12.26 

0.77  (0.44) 

Post  menopause  (years) 

133 

14.37x9.97 

127 

18.60x10.67 

-3.31  (O.OOD* 

Calcitiin  (mg) 

181 

946.16x393.60 

175 

865.43x356.35 

2.03  (0.04)* 

Viiamin  D  (lU) 

57 

198x212 

62 

161x133 

1.17(0.24) 

N-number  of  subjects  for  whom  data  were  available;  BMD=bonc  mineral  density;  r=indcpe.ndcnt  /  test  value; 
/)=probobiliry  value;  lU=iniernational  units;  •statistically  significant 

1  Table  ni.  Initial  Chnracirristics  of  Subjects  for  Categorical  Variables 

Vahiabj.v. 

Exerose 

N  (%) 

Control 

N  (%) 

C»i-Squarf. 

(r) 

Cigarette  smoking 

19  (9.0) 

25  (11.4) 

0.71  (0.40) 

Alcohol  consumption 

122  (59.5) 

111  (52.9) 

1.87  (0.17) 

F.strogeii/progcsierone  use 

20  (6.9) 

17(5.9) 

0.27  (0.60) 

Previous  fractures  |aii)'  site) 

58  (52.2) 

63  (57.8) 

0.68  (0.41) 

R.'trc  (white) 

213  (99.5) 

214  (98.6) 

0.98  (0.63) 

Results  arc  limited  to  studies  that  reponed  data  for  each  variable. 
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wirli  the  fact  that  the  summary  measures  obrained 
in  onr  previous  research  were  the  result  of  pooling 
the  outcomes  from  all  sites  assessed  at  the  femur 
(femoral  neck.  Ward’s  triangle,  trochanter,  in- 
tcrtrochanter).  Consequently,  it  may  be  that  im¬ 
provements  in  fiMD  occur  at  one  or  more  sites 
ojhcr  than  the  femoral  neck.  However,  we  were  un¬ 
able  to  address  the  effect  of  exercise  on  the  other 
BMD  sites  at  the  femur  in  this  investigation. 

The  recent  consensus  statement  from  the  Na¬ 
tional  Institutes  of  Health  su^csted  that  higher-im¬ 
pact  activities  and  resistance  training  may  have  the 
greatest  effect  on  BMD."'  However,  as  can  be  seen 
in  Table  I,  the  majority  of  studies  included  in  this 
investigation  used  lower-  rather  than  higher-impact 
types  of  activities — primarily  walking — as  an  inter¬ 
vention.  In  addition,  while  exercises  designed  to 
strengthen  the  hip  were  used  in  studies  that  em¬ 
ployed  a  progressive  resistance  training  protocol, 
the  majority  of  exercises  focused  on  movements  de¬ 
signed  to  strengthen  the  upper  body.  Thus,  it  m-ay 
be  that  the  lack  of  improvement  in  femoral  neck 
BMD  in  this  investigation  was  the  result  of  the  ex¬ 
ercise  protocols  employed.  However,  while  higher- 
impact  activities,  such  as  jumping  and  high-impact 
aerobic  dance,  may  be  more  beneficial  to  femoral 
neck  BMD^*'^’  countered  with  is¬ 

sues  of  adherence  to  a  regular  program  of  exercise 
as  well  as  the  potential  to  put  the  subjea  at  an  in¬ 
creased  risk  for  injury,  particularly  stress  facturcs- 
fracturcs  and  osteoarthritis.^®  Thus,  from  a 
practical  standpoint,  the  lowcr-impaa  exercise  pro¬ 
tocols  that  were  employed  in  many  of  our  included 


studies  are  probably  the  most  appropriate.  This 
may  be  especially  true  for  v^alicing,  since  it  is  the 
most  commoir  type  of  exercise  in  which  people  in 
the  United  States  participate.®* 

Since  the  terms  lower-  and  higher-impact  are 
broad  and  fairly  subjective  terms,  it  would  appear 
plausible  to  suggest  that  future  studies  examining 
the  effeas  of  exercise  on  BMD  make  some  attempt 
to  quantify  the  forces  involved  for  the  interventions 
employed.  For  those  studies  that  employ  a  progres¬ 
sive  resistance  training  protocol,  additional  lower 
leg  exercises  that  may  affect  femoral  neck  BMD 
should  be  employed.  Incorporation  of  the  above 
suggestions  should  result  in  a  better  understanding 
regarding  the  efficacy  of  exercise  for  improving 
BMD  at  the  femoral  neck. 

Despite  the  fact  that  exercise  did  not  have  any 
effect  on  femoral  neck  BMD,  such  activities  should 
almost  always  be  recommended.  For  example, 
while  exercise  may  not  improve  femoral  neck 
BMD,  it  may  increase  muscular  strength  and  bal¬ 
ance  and  improve  postural  stability,  thus  reducing 
the  risk  of  falling  and  the  subsequent  fracnircs  that 
can  result  from  falling0-^t>  While  it  is  important  for 
future  research  to  examine  the  efficacy  and  effec¬ 
tiveness  of  various  exercise  interventions  on 
femoral  neck  BMD,  it  would  appear  reasonable  to 
suggest  that  a  need  exists  for  increased  research 
that  addresses  the  effects  of  exercise  for  preventing 
osteoporotic  fractures  in  the  presence  and/or  ab¬ 
sence  of  changes  in  BMD. 

While  use  of  the  meta-analytic  approach  pro¬ 
vides  for  a  more  objective  evaluation  of  studies  rcl- 


1  Table  rV.  Femoral  Neck  Bone  Mineral  Density  Results  ' 

Group 

N 

iNmAL 

Final 

lOjPlTRLNCE 

Mran±SD 

MeansSD 

Mhan*SD 

F.xcrcisc 

295 

0.787a0.]23 

0.793*0.324 

0.004*0.039 

Control 

300 

0.763*0.322 

0.764*0.3  37 

0.003*0.048 

Table  V.  ANOVA  Summary  Table  for  Femoral  Neck  BMD  (General  Linear  Model) 

SoimcK  Of  Variation 

DF 

SS 

MS 

F  RATIO 

PVaiue 

Group  (exercise  and  control) 

3 

0.193 

0.193 

6.730 

O.OJO* 

Subjects  (group) 

593 

16.991 

0.0287 

— 

Trial  (initial  and  final  BMD) 

1 

0.00217 

0.00217 

2.277 

0.132 

Groupi^RTrial 

3 

0.000816 

0.000816 

0.857 

0.355 

Rc-sidiial 

593 

593 

0.565 

0.000952 

.  - 

Total 

1189 

17.7-52 

0.0149 

— 

ANOVAsanalysis  of  variance:  BMD=lx)ne  mineral  density;  DFsdcgrces  of  freedom;  SS=sum  of  squares;  MS=mean 

j  square;  Bsprobabilicy  value;  'statistically  significant 
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